Overview
Brought to you by YData
Dataset statistics
| Number of variables | 112 |
|---|---|
| Number of observations | 1926061 |
| Missing cells | 149451500 |
| Missing cells (%) | 69.3% |
| Total size in memory | 1.6 GiB |
| Average record size in memory | 896.0 B |
Variable types
| Text | 112 |
|---|
Dataset
| Description | Invertebrate Zoology NMNH Extant Specimen Records 0052489-241126133413365 |
|---|---|
| URL | https://doi.org/10.15468/dl.fya67r |
institutionID has constant value "urn:lsid:biocol.org:col:34871" | Constant |
collectionID has constant value "urn:uuid:f14c21a9-8cbf-4c8b-817f-d19d427e2dd6" | Constant |
institutionCode has constant value "USNM" | Constant |
collectionCode has constant value "IZ" | Constant |
datasetName has constant value "NMNH Extant Biology" | Constant |
associatedReferences has constant value "9" | Constant |
materialSampleID has constant value "North Pacific Ocean, Gulf Of California" | Constant |
higherGeographyID has constant value "24.1667" | Constant |
countryCode has constant value "24 10 00 N" | Constant |
verticalDatum has constant value "152" | Constant |
georeferencedBy has constant value "Idaho" | Constant |
latestAgeOrHighestStage has constant value "Moultrie" | Constant |
dateIdentified has constant value "-83.7685" | Constant |
originalNameUsage has constant value "GEOLocate" | Constant |
subfamily has constant value "47 57 0 N" | Constant |
tribe has constant value "129 4 0 W" | Constant |
subtribe has constant value "Seurat, L. G." | Constant |
cultivarEpithet has constant value "GEOLocate" | Constant |
nomenclaturalStatus has constant value "Camallanus seurati" | Constant |
recordNumber has 1804320 (93.7%) missing values | Missing |
recordedBy has 763966 (39.7%) missing values | Missing |
sex has 1744021 (90.5%) missing values | Missing |
lifeStage has 1837065 (95.4%) missing values | Missing |
occurrenceStatus has 1926059 (> 99.9%) missing values | Missing |
disposition has 1926059 (> 99.9%) missing values | Missing |
associatedMedia has 1672204 (86.8%) missing values | Missing |
associatedOccurrences has 1926059 (> 99.9%) missing values | Missing |
associatedReferences has 1926059 (> 99.9%) missing values | Missing |
associatedSequences has 1920937 (99.7%) missing values | Missing |
occurrenceRemarks has 1144278 (59.4%) missing values | Missing |
materialEntityRemarks has 1926059 (> 99.9%) missing values | Missing |
verbatimLabel has 1926059 (> 99.9%) missing values | Missing |
materialSampleID has 1926060 (> 99.9%) missing values | Missing |
eventType has 1926059 (> 99.9%) missing values | Missing |
fieldNumber has 1339537 (69.5%) missing values | Missing |
eventDate has 684431 (35.5%) missing values | Missing |
startDayOfYear has 772926 (40.1%) missing values | Missing |
endDayOfYear has 773095 (40.1%) missing values | Missing |
year has 684432 (35.5%) missing values | Missing |
month has 768070 (39.9%) missing values | Missing |
day has 841840 (43.7%) missing values | Missing |
verbatimEventDate has 1172997 (60.9%) missing values | Missing |
habitat has 1856817 (96.4%) missing values | Missing |
locationID has 983901 (51.1%) missing values | Missing |
higherGeographyID has 1926060 (> 99.9%) missing values | Missing |
higherGeography has 67820 (3.5%) missing values | Missing |
continent has 585602 (30.4%) missing values | Missing |
waterBody has 666547 (34.6%) missing values | Missing |
islandGroup has 1925291 (> 99.9%) missing values | Missing |
island has 1925083 (99.9%) missing values | Missing |
country has 141874 (7.4%) missing values | Missing |
countryCode has 1926060 (> 99.9%) missing values | Missing |
stateProvince has 943504 (49.0%) missing values | Missing |
county has 1786110 (92.7%) missing values | Missing |
locality has 642266 (33.3%) missing values | Missing |
minimumElevationInMeters has 1919257 (99.6%) missing values | Missing |
maximumElevationInMeters has 1922544 (99.8%) missing values | Missing |
verbatimElevation has 1925599 (> 99.9%) missing values | Missing |
verticalDatum has 1926060 (> 99.9%) missing values | Missing |
minimumDepthInMeters has 1143588 (59.4%) missing values | Missing |
maximumDepthInMeters has 1205034 (62.6%) missing values | Missing |
verbatimDepth has 1899821 (98.6%) missing values | Missing |
decimalLatitude has 927243 (48.1%) missing values | Missing |
decimalLongitude has 927246 (48.1%) missing values | Missing |
geodeticDatum has 1858158 (96.5%) missing values | Missing |
coordinatePrecision has 1926059 (> 99.9%) missing values | Missing |
pointRadiusSpatialFit has 1926059 (> 99.9%) missing values | Missing |
verbatimCoordinates has 1926059 (> 99.9%) missing values | Missing |
verbatimLatitude has 1854408 (96.3%) missing values | Missing |
verbatimLongitude has 1854475 (96.3%) missing values | Missing |
verbatimCoordinateSystem has 1246668 (64.7%) missing values | Missing |
footprintSRS has 1926059 (> 99.9%) missing values | Missing |
georeferencedBy has 1926060 (> 99.9%) missing values | Missing |
georeferenceProtocol has 1265567 (65.7%) missing values | Missing |
georeferenceSources has 1926058 (> 99.9%) missing values | Missing |
georeferenceRemarks has 1895791 (98.4%) missing values | Missing |
geologicalContextID has 1926058 (> 99.9%) missing values | Missing |
earliestEonOrLowestEonothem has 1926058 (> 99.9%) missing values | Missing |
latestEonOrHighestEonothem has 1926059 (> 99.9%) missing values | Missing |
earliestEraOrLowestErathem has 1926052 (> 99.9%) missing values | Missing |
earliestPeriodOrLowestSystem has 1926051 (> 99.9%) missing values | Missing |
latestPeriodOrHighestSystem has 1926054 (> 99.9%) missing values | Missing |
earliestEpochOrLowestSeries has 1926050 (> 99.9%) missing values | Missing |
latestEpochOrHighestSeries has 1926054 (> 99.9%) missing values | Missing |
earliestAgeOrLowestStage has 1926054 (> 99.9%) missing values | Missing |
latestAgeOrHighestStage has 1926060 (> 99.9%) missing values | Missing |
highestBiostratigraphicZone has 1926059 (> 99.9%) missing values | Missing |
verbatimIdentification has 1926059 (> 99.9%) missing values | Missing |
identificationQualifier has 1907923 (99.1%) missing values | Missing |
typeStatus has 1838230 (95.4%) missing values | Missing |
identifiedBy has 1085026 (56.3%) missing values | Missing |
identifiedByID has 1926059 (> 99.9%) missing values | Missing |
dateIdentified has 1926060 (> 99.9%) missing values | Missing |
identificationReferences has 1926052 (> 99.9%) missing values | Missing |
identificationRemarks has 1926056 (> 99.9%) missing values | Missing |
scientificNameID has 1926059 (> 99.9%) missing values | Missing |
acceptedNameUsageID has 1926053 (> 99.9%) missing values | Missing |
nameAccordingToID has 1926059 (> 99.9%) missing values | Missing |
scientificName has 353701 (18.4%) missing values | Missing |
parentNameUsage has 1926059 (> 99.9%) missing values | Missing |
originalNameUsage has 1926060 (> 99.9%) missing values | Missing |
class has 76135 (4.0%) missing values | Missing |
order has 940799 (48.8%) missing values | Missing |
family has 191835 (10.0%) missing values | Missing |
subfamily has 1926060 (> 99.9%) missing values | Missing |
tribe has 1926060 (> 99.9%) missing values | Missing |
subtribe has 1926060 (> 99.9%) missing values | Missing |
genus has 353878 (18.4%) missing values | Missing |
subgenus has 1813329 (94.1%) missing values | Missing |
specificEpithet has 353916 (18.4%) missing values | Missing |
infraspecificEpithet has 1866911 (96.9%) missing values | Missing |
cultivarEpithet has 1926058 (> 99.9%) missing values | Missing |
taxonRank has 1866911 (96.9%) missing values | Missing |
scientificNameAuthorship has 756930 (39.3%) missing values | Missing |
nomenclaturalCode has 1926059 (> 99.9%) missing values | Missing |
nomenclaturalStatus has 1926060 (> 99.9%) missing values | Missing |
gbifID has unique values | Unique |
occurrenceID has unique values | Unique |
Reproduction
| Analysis started | 2025-01-14 16:46:22.038176 |
|---|---|
| Analysis finished | 2025-01-14 16:47:42.199936 |
| Duration | 1 minute and 20.16 seconds |
| Software version | ydata-profiling vv4.12.1 |
| Download configuration | config.json |
Variables
gbifID
Text
Unique 
| Distinct | 1926061 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 14.7 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Unique
| Unique | 1926061 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 1321728981 |
|---|---|
| 2nd row | 1320179422 |
| 3rd row | 1320179575 |
| 4th row | 1321729723 |
| 5th row | 1320179846 |
| Value | Count | Frequency (%) |
| 1321728981 | 1 | < 0.1% |
| 1320183643 | 1 | < 0.1% |
| 1321730497 | 1 | < 0.1% |
| 1320180949 | 1 | < 0.1% |
| 1320181165 | 1 | < 0.1% |
| 1456364805 | 1 | < 0.1% |
| 1320182209 | 1 | < 0.1% |
| 1321732097 | 1 | < 0.1% |
| 2571470239 | 1 | < 0.1% |
| 1320182449 | 1 | < 0.1% |
| Other values (1926051) | 1926051 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 3940992 | |
| 3 | 2929721 | |
| 2 | 2443540 | |
| 7 | 1519647 | 7.9% |
| 8 | 1483597 | 7.7% |
| 0 | 1475792 | 7.7% |
| 9 | 1468721 | 7.6% |
| 5 | 1371139 | 7.1% |
| 6 | 1316858 | 6.8% |
| 4 | 1310603 | 6.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 19260610 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 3940992 | |
| 3 | 2929721 | |
| 2 | 2443540 | |
| 7 | 1519647 | 7.9% |
| 8 | 1483597 | 7.7% |
| 0 | 1475792 | 7.7% |
| 9 | 1468721 | 7.6% |
| 5 | 1371139 | 7.1% |
| 6 | 1316858 | 6.8% |
| 4 | 1310603 | 6.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 19260610 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 3940992 | |
| 3 | 2929721 | |
| 2 | 2443540 | |
| 7 | 1519647 | 7.9% |
| 8 | 1483597 | 7.7% |
| 0 | 1475792 | 7.7% |
| 9 | 1468721 | 7.6% |
| 5 | 1371139 | 7.1% |
| 6 | 1316858 | 6.8% |
| 4 | 1310603 | 6.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 19260610 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 3940992 | |
| 3 | 2929721 | |
| 2 | 2443540 | |
| 7 | 1519647 | 7.9% |
| 8 | 1483597 | 7.7% |
| 0 | 1475792 | 7.7% |
| 9 | 1468721 | 7.6% |
| 5 | 1371139 | 7.1% |
| 6 | 1316858 | 6.8% |
| 4 | 1310603 | 6.8% |
modified
Text
| Distinct | 113479 |
|---|---|
| Distinct (%) | 5.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 14.7 MiB |
Length
| Max length | 19 |
|---|---|
| Median length | 19 |
| Mean length | 19 |
| Min length | 19 |
Unique
| Unique | 62369 ? |
|---|---|
| Unique (%) | 3.2% |
Sample
| 1st row | 2021-10-06 15:29:00 |
|---|---|
| 2nd row | 2024-09-25 16:08:00 |
| 3rd row | 2020-01-06 17:42:00 |
| 4th row | 2018-09-17 12:46:00 |
| 5th row | 2024-09-25 15:32:00 |
| Value | Count | Frequency (%) |
| 2024-09-25 | 692724 | 18.0% |
| 2018-09-17 | 227538 | 5.9% |
| 2019-11-01 | 80341 | 2.1% |
| 2021-10-06 | 56982 | 1.5% |
| 2014-10-08 | 33474 | 0.9% |
| 2014-10-09 | 25882 | 0.7% |
| 2017-03-29 | 25186 | 0.7% |
| 2013-01-10 | 21865 | 0.6% |
| 2024-08-19 | 19853 | 0.5% |
| 2014-10-20 | 17831 | 0.5% |
| Other values (3940) | 2650446 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 8969856 | |
| 2 | 4987650 | |
| 1 | 4687977 | |
| - | 3852122 | |
| : | 3852122 | |
| 1926061 | 5.3% | |
| 4 | 1757416 | 4.8% |
| 5 | 1701788 | 4.7% |
| 9 | 1536715 | 4.2% |
| 3 | 1149662 | 3.1% |
| Other values (3) | 2173790 | 5.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 26964854 | |
| Dash Punctuation | 3852122 | 10.5% |
| Other Punctuation | 3852122 | 10.5% |
| Space Separator | 1926061 | 5.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 8969856 | |
| 2 | 4987650 | |
| 1 | 4687977 | |
| 4 | 1757416 | 6.5% |
| 5 | 1701788 | 6.3% |
| 9 | 1536715 | 5.7% |
| 3 | 1149662 | 4.3% |
| 7 | 807635 | 3.0% |
| 6 | 700968 | 2.6% |
| 8 | 665187 | 2.5% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 3852122 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 3852122 |
Space Separator
| Value | Count | Frequency (%) |
| 1926061 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 36595159 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 8969856 | |
| 2 | 4987650 | |
| 1 | 4687977 | |
| - | 3852122 | |
| : | 3852122 | |
| 1926061 | 5.3% | |
| 4 | 1757416 | 4.8% |
| 5 | 1701788 | 4.7% |
| 9 | 1536715 | 4.2% |
| 3 | 1149662 | 3.1% |
| Other values (3) | 2173790 | 5.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 36595159 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 8969856 | |
| 2 | 4987650 | |
| 1 | 4687977 | |
| - | 3852122 | |
| : | 3852122 | |
| 1926061 | 5.3% | |
| 4 | 1757416 | 4.8% |
| 5 | 1701788 | 4.7% |
| 9 | 1536715 | 4.2% |
| 3 | 1149662 | 3.1% |
| Other values (3) | 2173790 | 5.9% |
institutionID
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 14.7 MiB |
Length
| Max length | 29 |
|---|---|
| Median length | 29 |
| Mean length | 29 |
| Min length | 29 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | urn:lsid:biocol.org:col:34871 |
|---|---|
| 2nd row | urn:lsid:biocol.org:col:34871 |
| 3rd row | urn:lsid:biocol.org:col:34871 |
| 4th row | urn:lsid:biocol.org:col:34871 |
| 5th row | urn:lsid:biocol.org:col:34871 |
| Value | Count | Frequency (%) |
| urn:lsid:biocol.org:col:34871 | 1926061 |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 7704244 | |
| : | 7704244 | |
| l | 5778183 | 10.3% |
| i | 3852122 | 6.9% |
| r | 3852122 | 6.9% |
| c | 3852122 | 6.9% |
| g | 1926061 | 3.4% |
| 7 | 1926061 | 3.4% |
| 8 | 1926061 | 3.4% |
| 4 | 1926061 | 3.4% |
| Other values (8) | 15408488 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 36595159 | |
| Other Punctuation | 9630305 | 17.2% |
| Decimal Number | 9630305 | 17.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 7704244 | |
| l | 5778183 | |
| i | 3852122 | |
| r | 3852122 | |
| c | 3852122 | |
| g | 1926061 | 5.3% |
| u | 1926061 | 5.3% |
| b | 1926061 | 5.3% |
| d | 1926061 | 5.3% |
| s | 1926061 | 5.3% |
Decimal Number
| Value | Count | Frequency (%) |
| 7 | 1926061 | |
| 8 | 1926061 | |
| 4 | 1926061 | |
| 3 | 1926061 | |
| 1 | 1926061 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 7704244 | |
| . | 1926061 | 20.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 36595159 | |
| Common | 19260610 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 7704244 | |
| l | 5778183 | |
| i | 3852122 | |
| r | 3852122 | |
| c | 3852122 | |
| g | 1926061 | 5.3% |
| u | 1926061 | 5.3% |
| b | 1926061 | 5.3% |
| d | 1926061 | 5.3% |
| s | 1926061 | 5.3% |
Common
| Value | Count | Frequency (%) |
| : | 7704244 | |
| 7 | 1926061 | 10.0% |
| 8 | 1926061 | 10.0% |
| 4 | 1926061 | 10.0% |
| 3 | 1926061 | 10.0% |
| . | 1926061 | 10.0% |
| 1 | 1926061 | 10.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 55855769 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 7704244 | |
| : | 7704244 | |
| l | 5778183 | 10.3% |
| i | 3852122 | 6.9% |
| r | 3852122 | 6.9% |
| c | 3852122 | 6.9% |
| g | 1926061 | 3.4% |
| 7 | 1926061 | 3.4% |
| 8 | 1926061 | 3.4% |
| 4 | 1926061 | 3.4% |
| Other values (8) | 15408488 |
collectionID
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 14.7 MiB |
Length
| Max length | 45 |
|---|---|
| Median length | 45 |
| Mean length | 45 |
| Min length | 45 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | urn:uuid:f14c21a9-8cbf-4c8b-817f-d19d427e2dd6 |
|---|---|
| 2nd row | urn:uuid:f14c21a9-8cbf-4c8b-817f-d19d427e2dd6 |
| 3rd row | urn:uuid:f14c21a9-8cbf-4c8b-817f-d19d427e2dd6 |
| 4th row | urn:uuid:f14c21a9-8cbf-4c8b-817f-d19d427e2dd6 |
| 5th row | urn:uuid:f14c21a9-8cbf-4c8b-817f-d19d427e2dd6 |
| Value | Count | Frequency (%) |
| urn:uuid:f14c21a9-8cbf-4c8b-817f-d19d427e2dd6 | 1926061 |
Most occurring characters
| Value | Count | Frequency (%) |
| d | 9630305 | |
| 1 | 7704244 | 8.9% |
| - | 7704244 | 8.9% |
| u | 5778183 | 6.7% |
| 8 | 5778183 | 6.7% |
| 2 | 5778183 | 6.7% |
| 4 | 5778183 | 6.7% |
| c | 5778183 | 6.7% |
| f | 5778183 | 6.7% |
| 9 | 3852122 | 4.4% |
| Other values (9) | 23112732 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 40447281 | |
| Decimal Number | 34669098 | |
| Dash Punctuation | 7704244 | 8.9% |
| Other Punctuation | 3852122 | 4.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| d | 9630305 | |
| u | 5778183 | |
| c | 5778183 | |
| f | 5778183 | |
| b | 3852122 | 9.5% |
| r | 1926061 | 4.8% |
| i | 1926061 | 4.8% |
| a | 1926061 | 4.8% |
| n | 1926061 | 4.8% |
| e | 1926061 | 4.8% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 7704244 | |
| 8 | 5778183 | |
| 2 | 5778183 | |
| 4 | 5778183 | |
| 9 | 3852122 | |
| 7 | 3852122 | |
| 6 | 1926061 | 5.6% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 7704244 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 3852122 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 46225464 | |
| Latin | 40447281 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| d | 9630305 | |
| u | 5778183 | |
| c | 5778183 | |
| f | 5778183 | |
| b | 3852122 | 9.5% |
| r | 1926061 | 4.8% |
| i | 1926061 | 4.8% |
| a | 1926061 | 4.8% |
| n | 1926061 | 4.8% |
| e | 1926061 | 4.8% |
Common
| Value | Count | Frequency (%) |
| 1 | 7704244 | |
| - | 7704244 | |
| 8 | 5778183 | |
| 2 | 5778183 | |
| 4 | 5778183 | |
| 9 | 3852122 | |
| : | 3852122 | |
| 7 | 3852122 | |
| 6 | 1926061 | 4.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 86672745 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| d | 9630305 | |
| 1 | 7704244 | 8.9% |
| - | 7704244 | 8.9% |
| u | 5778183 | 6.7% |
| 8 | 5778183 | 6.7% |
| 2 | 5778183 | 6.7% |
| 4 | 5778183 | 6.7% |
| c | 5778183 | 6.7% |
| f | 5778183 | 6.7% |
| 9 | 3852122 | 4.4% |
| Other values (9) | 23112732 |
institutionCode
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 14.7 MiB |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | USNM |
|---|---|
| 2nd row | USNM |
| 3rd row | USNM |
| 4th row | USNM |
| 5th row | USNM |
| Value | Count | Frequency (%) |
| usnm | 1926061 |
Most occurring characters
| Value | Count | Frequency (%) |
| U | 1926061 | |
| S | 1926061 | |
| N | 1926061 | |
| M | 1926061 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 7704244 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 1926061 | |
| S | 1926061 | |
| N | 1926061 | |
| M | 1926061 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 7704244 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| U | 1926061 | |
| S | 1926061 | |
| N | 1926061 | |
| M | 1926061 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 7704244 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| U | 1926061 | |
| S | 1926061 | |
| N | 1926061 | |
| M | 1926061 |
collectionCode
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 14.7 MiB |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | IZ |
|---|---|
| 2nd row | IZ |
| 3rd row | IZ |
| 4th row | IZ |
| 5th row | IZ |
| Value | Count | Frequency (%) |
| iz | 1926061 |
Most occurring characters
| Value | Count | Frequency (%) |
| I | 1926061 | |
| Z | 1926061 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 3852122 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 1926061 | |
| Z | 1926061 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3852122 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| I | 1926061 | |
| Z | 1926061 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3852122 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| I | 1926061 | |
| Z | 1926061 |
datasetName
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 14.7 MiB |
Length
| Max length | 19 |
|---|---|
| Median length | 19 |
| Mean length | 19 |
| Min length | 19 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | NMNH Extant Biology |
|---|---|
| 2nd row | NMNH Extant Biology |
| 3rd row | NMNH Extant Biology |
| 4th row | NMNH Extant Biology |
| 5th row | NMNH Extant Biology |
| Value | Count | Frequency (%) |
| nmnh | 1926061 | |
| extant | 1926061 | |
| biology | 1926061 |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 3852122 | 10.5% |
| 3852122 | 10.5% | |
| t | 3852122 | 10.5% |
| o | 3852122 | 10.5% |
| M | 1926061 | 5.3% |
| H | 1926061 | 5.3% |
| E | 1926061 | 5.3% |
| x | 1926061 | 5.3% |
| a | 1926061 | 5.3% |
| n | 1926061 | 5.3% |
| Other values (5) | 9630305 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 21186671 | |
| Uppercase Letter | 11556366 | |
| Space Separator | 3852122 | 10.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 3852122 | |
| o | 3852122 | |
| x | 1926061 | |
| a | 1926061 | |
| n | 1926061 | |
| i | 1926061 | |
| l | 1926061 | |
| g | 1926061 | |
| y | 1926061 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 3852122 | |
| M | 1926061 | |
| H | 1926061 | |
| E | 1926061 | |
| B | 1926061 |
Space Separator
| Value | Count | Frequency (%) |
| 3852122 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 32743037 | |
| Common | 3852122 | 10.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| N | 3852122 | |
| t | 3852122 | |
| o | 3852122 | |
| M | 1926061 | 5.9% |
| H | 1926061 | 5.9% |
| E | 1926061 | 5.9% |
| x | 1926061 | 5.9% |
| a | 1926061 | 5.9% |
| n | 1926061 | 5.9% |
| B | 1926061 | 5.9% |
| Other values (4) | 7704244 |
Common
| Value | Count | Frequency (%) |
| 3852122 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 36595159 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| N | 3852122 | 10.5% |
| 3852122 | 10.5% | |
| t | 3852122 | 10.5% |
| o | 3852122 | 10.5% |
| M | 1926061 | 5.3% |
| H | 1926061 | 5.3% |
| E | 1926061 | 5.3% |
| x | 1926061 | 5.3% |
| a | 1926061 | 5.3% |
| n | 1926061 | 5.3% |
| Other values (5) | 9630305 |
basisOfRecord
Text
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 14.7 MiB |
Length
| Max length | 18 |
|---|---|
| Median length | 17 |
| Mean length | 17.00144025 |
| Min length | 16 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | PreservedSpecimen |
|---|---|
| 2nd row | PreservedSpecimen |
| 3rd row | PreservedSpecimen |
| 4th row | PreservedSpecimen |
| 5th row | PreservedSpecimen |
| Value | Count | Frequency (%) |
| preservedspecimen | 1921925 | |
| machineobservation | 3455 | 0.2% |
| humanobservation | 681 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 9617216 | |
| r | 3847986 | |
| n | 1930197 | 5.9% |
| i | 1929516 | 5.9% |
| s | 1926061 | 5.9% |
| v | 1926061 | 5.9% |
| c | 1925380 | 5.9% |
| m | 1922606 | 5.9% |
| P | 1921925 | 5.9% |
| p | 1921925 | 5.9% |
| Other values (11) | 3876938 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 28893689 | |
| Uppercase Letter | 3852122 | 11.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 9617216 | |
| r | 3847986 | |
| n | 1930197 | 6.7% |
| i | 1929516 | 6.7% |
| s | 1926061 | 6.7% |
| v | 1926061 | 6.7% |
| c | 1925380 | 6.7% |
| m | 1922606 | 6.7% |
| p | 1921925 | 6.7% |
| d | 1921925 | 6.7% |
| Other values (6) | 24816 | 0.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 1921925 | |
| S | 1921925 | |
| O | 4136 | 0.1% |
| M | 3455 | 0.1% |
| H | 681 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 32745811 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 9617216 | |
| r | 3847986 | |
| n | 1930197 | 5.9% |
| i | 1929516 | 5.9% |
| s | 1926061 | 5.9% |
| v | 1926061 | 5.9% |
| c | 1925380 | 5.9% |
| m | 1922606 | 5.9% |
| P | 1921925 | 5.9% |
| p | 1921925 | 5.9% |
| Other values (11) | 3876938 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 32745811 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 9617216 | |
| r | 3847986 | |
| n | 1930197 | 5.9% |
| i | 1929516 | 5.9% |
| s | 1926061 | 5.9% |
| v | 1926061 | 5.9% |
| c | 1925380 | 5.9% |
| m | 1922606 | 5.9% |
| P | 1921925 | 5.9% |
| p | 1921925 | 5.9% |
| Other values (11) | 3876938 |
occurrenceID
Text
Unique 
| Distinct | 1926061 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 14.7 MiB |
Length
| Max length | 63 |
|---|---|
| Median length | 63 |
| Mean length | 63 |
| Min length | 63 |
Unique
| Unique | 1926061 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | http://n2t.net/ark:/65665/3c831e8df-8799-47a1-8dcf-bcb0b77fd3e3 |
|---|---|
| 2nd row | http://n2t.net/ark:/65665/383ab647e-23a7-4086-b71e-36212ccc0eb2 |
| 3rd row | http://n2t.net/ark:/65665/383adbf6e-f769-4dc3-8bef-550530af49ee |
| 4th row | http://n2t.net/ark:/65665/3c83aad38-c935-46fa-96c3-e450ebb169cf |
| 5th row | http://n2t.net/ark:/65665/383b126a6-bf3a-4908-bc33-e4435555fcc5 |
| Value | Count | Frequency (%) |
| http://n2t.net/ark:/65665/3c831e8df-8799-47a1-8dcf-bcb0b77fd3e3 | 1 | < 0.1% |
| http://n2t.net/ark:/65665/383db58fb-5d8c-4076-bec7-fa6e28ed98a7 | 1 | < 0.1% |
| http://n2t.net/ark:/65665/3c843fd56-7874-4858-b938-14fdfcb5544c | 1 | < 0.1% |
| http://n2t.net/ark:/65665/383bcb698-5477-4feb-9966-d9adae345f09 | 1 | < 0.1% |
| http://n2t.net/ark:/65665/383bfd766-40bc-4ede-82ca-0df3775130f3 | 1 | < 0.1% |
| http://n2t.net/ark:/65665/3c84cf22c-2b9b-49fb-91ed-f85efd9e9fa7 | 1 | < 0.1% |
| http://n2t.net/ark:/65665/383cb8e2a-4f46-4138-82be-3d7989851c9e | 1 | < 0.1% |
| http://n2t.net/ark:/65665/3c856104b-9825-44b9-8b57-e69b58510bf8 | 1 | < 0.1% |
| http://n2t.net/ark:/65665/3c856ef4e-b135-45c8-8511-c533777f0d7a | 1 | < 0.1% |
| http://n2t.net/ark:/65665/383ce04ed-5cd8-4a05-90df-39eccc31a990 | 1 | < 0.1% |
| Other values (1926051) | 1926051 |
Most occurring characters
| Value | Count | Frequency (%) |
| / | 9630305 | 7.9% |
| 6 | 9392699 | 7.7% |
| - | 7704244 | 6.3% |
| t | 7704244 | 6.3% |
| 5 | 7459909 | 6.1% |
| a | 6017570 | 5.0% |
| 3 | 5538537 | 4.6% |
| e | 5536681 | 4.6% |
| 2 | 5536450 | 4.6% |
| 4 | 5533592 | 4.6% |
| Other values (16) | 51287612 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 52484647 | |
| Lowercase Letter | 45744464 | |
| Other Punctuation | 15408488 | 12.7% |
| Dash Punctuation | 7704244 | 6.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 7704244 | |
| a | 6017570 | |
| e | 5536681 | |
| b | 4094745 | |
| n | 3852122 | |
| d | 3614893 | |
| c | 3611051 | |
| f | 3608914 | |
| k | 1926061 | 4.2% |
| r | 1926061 | 4.2% |
| Other values (2) | 3852122 |
Decimal Number
| Value | Count | Frequency (%) |
| 6 | 9392699 | |
| 5 | 7459909 | |
| 3 | 5538537 | |
| 2 | 5536450 | |
| 4 | 5533592 | |
| 8 | 4094794 | |
| 9 | 4094577 | |
| 1 | 3613193 | 6.9% |
| 7 | 3610777 | 6.9% |
| 0 | 3610119 | 6.9% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 9630305 | |
| : | 3852122 | 25.0% |
| . | 1926061 | 12.5% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 7704244 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 75597379 | |
| Latin | 45744464 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| / | 9630305 | |
| 6 | 9392699 | |
| - | 7704244 | |
| 5 | 7459909 | |
| 3 | 5538537 | |
| 2 | 5536450 | |
| 4 | 5533592 | |
| 8 | 4094794 | 5.4% |
| 9 | 4094577 | 5.4% |
| : | 3852122 | 5.1% |
| Other values (4) | 12760150 |
Latin
| Value | Count | Frequency (%) |
| t | 7704244 | |
| a | 6017570 | |
| e | 5536681 | |
| b | 4094745 | |
| n | 3852122 | |
| d | 3614893 | |
| c | 3611051 | |
| f | 3608914 | |
| k | 1926061 | 4.2% |
| r | 1926061 | 4.2% |
| Other values (2) | 3852122 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 121341843 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| / | 9630305 | 7.9% |
| 6 | 9392699 | 7.7% |
| - | 7704244 | 6.3% |
| t | 7704244 | 6.3% |
| 5 | 7459909 | 6.1% |
| a | 6017570 | 5.0% |
| 3 | 5538537 | 4.6% |
| e | 5536681 | 4.6% |
| 2 | 5536450 | 4.6% |
| 4 | 5533592 | 4.6% |
| Other values (16) | 51287612 |
catalogNumber
Text
| Distinct | 1355224 |
|---|---|
| Distinct (%) | 70.4% |
| Missing | 5 |
| Missing (%) | < 0.1% |
| Memory size | 14.7 MiB |
Length
| Max length | 16 |
|---|---|
| Median length | 11 |
| Mean length | 11.0374122 |
| Min length | 6 |
Unique
| Unique | 1024370 ? |
|---|---|
| Unique (%) | 53.2% |
Sample
| 1st row | USNM 1119015 |
|---|---|
| 2nd row | USNM 55168 |
| 3rd row | USNM 52536 |
| 4th row | USNM E40844 |
| 5th row | USNM 241160 |
| Value | Count | Frequency (%) |
| usnm | 1926056 | |
| 31 | < 0.1% | |
| 284908 | 16 | < 0.1% |
| 653324 | 13 | < 0.1% |
| 5357 | 11 | < 0.1% |
| 15490 | 10 | < 0.1% |
| 859036 | 10 | < 0.1% |
| 224878 | 10 | < 0.1% |
| 22869 | 10 | < 0.1% |
| 284377 | 9 | < 0.1% |
| Other values (1351980) | 1925969 |
Most occurring characters
| Value | Count | Frequency (%) |
| M | 1928174 | 9.1% |
| U | 1926163 | 9.1% |
| 1926089 | 9.1% | |
| S | 1926056 | 9.1% |
| N | 1926056 | 9.1% |
| 1 | 1809561 | 8.5% |
| 2 | 1247347 | 5.9% |
| 3 | 1147683 | 5.4% |
| 4 | 1110632 | 5.2% |
| 5 | 1088174 | 5.1% |
| Other values (53) | 5222739 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 11555577 | |
| Uppercase Letter | 7762060 | |
| Space Separator | 1926089 | 9.1% |
| Lowercase Letter | 11685 | 0.1% |
| Other Punctuation | 3259 | < 0.1% |
| Dash Punctuation | 2 | < 0.1% |
| Close Punctuation | 1 | < 0.1% |
| Open Punctuation | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 8272 | |
| b | 1738 | 14.9% |
| c | 637 | 5.5% |
| d | 326 | 2.8% |
| e | 206 | 1.8% |
| f | 143 | 1.2% |
| g | 87 | 0.7% |
| h | 61 | 0.5% |
| i | 40 | 0.3% |
| j | 35 | 0.3% |
| Other values (16) | 140 | 1.2% |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 1928174 | |
| U | 1926163 | |
| S | 1926056 | |
| N | 1926056 | |
| E | 53442 | 0.7% |
| I | 778 | < 0.1% |
| A | 697 | < 0.1% |
| X | 326 | < 0.1% |
| B | 177 | < 0.1% |
| D | 128 | < 0.1% |
| Other values (10) | 63 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1809561 | |
| 2 | 1247347 | |
| 3 | 1147683 | |
| 4 | 1110632 | |
| 5 | 1088174 | |
| 8 | 1073263 | |
| 6 | 1062173 | |
| 7 | 1058767 | |
| 0 | 1001934 | |
| 9 | 956043 |
Other Punctuation
| Value | Count | Frequency (%) |
| * | 3252 | |
| . | 6 | 0.2% |
| & | 1 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 1926089 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 13484929 | |
| Latin | 7773745 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| M | 1928174 | |
| U | 1926163 | |
| S | 1926056 | |
| N | 1926056 | |
| E | 53442 | 0.7% |
| a | 8272 | 0.1% |
| b | 1738 | < 0.1% |
| I | 778 | < 0.1% |
| A | 697 | < 0.1% |
| c | 637 | < 0.1% |
| Other values (36) | 1732 | < 0.1% |
Common
| Value | Count | Frequency (%) |
| 1926089 | ||
| 1 | 1809561 | |
| 2 | 1247347 | |
| 3 | 1147683 | |
| 4 | 1110632 | |
| 5 | 1088174 | |
| 8 | 1073263 | |
| 6 | 1062173 | |
| 7 | 1058767 | |
| 0 | 1001934 | |
| Other values (7) | 959306 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 21258674 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| M | 1928174 | 9.1% |
| U | 1926163 | 9.1% |
| 1926089 | 9.1% | |
| S | 1926056 | 9.1% |
| N | 1926056 | 9.1% |
| 1 | 1809561 | 8.5% |
| 2 | 1247347 | 5.9% |
| 3 | 1147683 | 5.4% |
| 4 | 1110632 | 5.2% |
| 5 | 1088174 | 5.1% |
| Other values (53) | 5222739 |
recordNumber
Text
Missing 
| Distinct | 119483 |
|---|---|
| Distinct (%) | 98.1% |
| Missing | 1804320 |
| Missing (%) | 93.7% |
| Memory size | 14.7 MiB |
Length
| Max length | 87 |
|---|---|
| Median length | 14 |
| Mean length | 13.17353234 |
| Min length | 1 |
Unique
| Unique | 118854 ? |
|---|---|
| Unique (%) | 97.6% |
Sample
| 1st row | USNPC # 001298 |
|---|---|
| 2nd row | FPlrv_430 |
| 3rd row | H-2284 |
| 4th row | USNPC # 066527 |
| 5th row | USNPC # 009815 |
| Value | Count | Frequency (%) |
| 88136 | ||
| usnpc | 88055 | |
| ullz | 5209 | 1.7% |
| rh | 1566 | 0.5% |
| k-rh | 1554 | 0.5% |
| ce16007-event | 223 | 0.1% |
| 2208 | 102 | < 0.1% |
| 1430 | 92 | < 0.1% |
| 1513 | 80 | < 0.1% |
| beauty | 75 | < 0.1% |
| Other values (119402) | 122305 |
Most occurring characters
| Value | Count | Frequency (%) |
| 185656 | 11.6% | |
| 0 | 161160 | 10.0% |
| C | 97548 | 6.1% |
| S | 95220 | 5.9% |
| U | 94859 | 5.9% |
| P | 94137 | 5.9% |
| N | 93444 | 5.8% |
| # | 88212 | 5.5% |
| 1 | 82997 | 5.2% |
| 2 | 65144 | 4.1% |
| Other values (71) | 545382 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 709617 | |
| Uppercase Letter | 576479 | |
| Space Separator | 185656 | 11.6% |
| Other Punctuation | 91627 | 5.7% |
| Dash Punctuation | 15239 | 1.0% |
| Connector Punctuation | 14089 | 0.9% |
| Lowercase Letter | 10490 | 0.7% |
| Close Punctuation | 281 | < 0.1% |
| Open Punctuation | 271 | < 0.1% |
| Math Symbol | 10 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 97548 | |
| S | 95220 | |
| U | 94859 | |
| P | 94137 | |
| N | 93444 | |
| L | 12317 | 2.1% |
| E | 11806 | 2.0% |
| R | 10315 | 1.8% |
| I | 7528 | 1.3% |
| B | 7241 | 1.3% |
| Other values (16) | 52064 |
Lowercase Letter
| Value | Count | Frequency (%) |
| l | 1416 | |
| v | 1363 | |
| a | 1349 | |
| r | 1268 | |
| t | 873 | |
| e | 713 | |
| s | 657 | 6.3% |
| n | 489 | 4.7% |
| c | 300 | 2.9% |
| i | 287 | 2.7% |
| Other values (16) | 1775 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 161160 | |
| 1 | 82997 | |
| 2 | 65144 | |
| 6 | 58922 | 8.3% |
| 3 | 58881 | 8.3% |
| 7 | 58482 | 8.2% |
| 4 | 56680 | 8.0% |
| 8 | 56221 | 7.9% |
| 9 | 55917 | 7.9% |
| 5 | 55213 | 7.8% |
Other Punctuation
| Value | Count | Frequency (%) |
| # | 88212 | |
| . | 2351 | 2.6% |
| : | 559 | 0.6% |
| , | 400 | 0.4% |
| ; | 65 | 0.1% |
| / | 20 | < 0.1% |
| & | 10 | < 0.1% |
| ? | 7 | < 0.1% |
| * | 3 | < 0.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 15238 | |
| – | 1 | < 0.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 273 | |
| ] | 8 | 2.8% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 263 | |
| [ | 8 | 3.0% |
Math Symbol
| Value | Count | Frequency (%) |
| + | 5 | |
| = | 5 |
Space Separator
| Value | Count | Frequency (%) |
| 185656 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 14089 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1016790 | |
| Latin | 586969 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| C | 97548 | |
| S | 95220 | |
| U | 94859 | |
| P | 94137 | |
| N | 93444 | |
| L | 12317 | 2.1% |
| E | 11806 | 2.0% |
| R | 10315 | 1.8% |
| I | 7528 | 1.3% |
| B | 7241 | 1.2% |
| Other values (42) | 62554 |
Common
| Value | Count | Frequency (%) |
| 185656 | ||
| 0 | 161160 | |
| # | 88212 | |
| 1 | 82997 | |
| 2 | 65144 | 6.4% |
| 6 | 58922 | 5.8% |
| 3 | 58881 | 5.8% |
| 7 | 58482 | 5.8% |
| 4 | 56680 | 5.6% |
| 8 | 56221 | 5.5% |
| Other values (19) | 144435 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1603758 | |
| Punctuation | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 185656 | 11.6% | |
| 0 | 161160 | 10.0% |
| C | 97548 | 6.1% |
| S | 95220 | 5.9% |
| U | 94859 | 5.9% |
| P | 94137 | 5.9% |
| N | 93444 | 5.8% |
| # | 88212 | 5.5% |
| 1 | 82997 | 5.2% |
| 2 | 65144 | 4.1% |
| Other values (70) | 545381 |
Punctuation
| Value | Count | Frequency (%) |
| – | 1 |
recordedBy
Text
Missing 
| Distinct | 37538 |
|---|---|
| Distinct (%) | 3.2% |
| Missing | 763966 |
| Missing (%) | 39.7% |
| Memory size | 14.7 MiB |
Length
| Max length | 15905 |
|---|---|
| Median length | 156 |
| Mean length | 23.04389228 |
| Min length | 1 |
Unique
| Unique | 16586 ? |
|---|---|
| Unique (%) | 1.4% |
Sample
| 1st row | VIMS for BLM/ MMS |
|---|---|
| 2nd row | Lgl Ecological Research Associates/ Environmental Science And Engineering For BLM/ MMS |
| 3rd row | University of Southern California |
| 4th row | United States Fish Commission |
| 5th row | United States Fish Commission |
| Value | Count | Frequency (%) |
| mms | 180985 | 4.2% |
| blm | 180983 | 4.2% |
| for | 178027 | 4.2% |
| fish | 168335 | 3.9% |
| united | 164134 | 3.8% |
| states | 163470 | 3.8% |
| commission | 157053 | 3.7% |
| 149555 | 3.5% | |
| of | 101735 | 2.4% |
| j | 101445 | 2.4% |
| Other values (19699) | 2735993 |
Most occurring characters
| Value | Count | Frequency (%) |
| 3118394 | 11.6% | |
| e | 2082026 | 7.8% |
| i | 1878666 | 7.0% |
| n | 1615750 | 6.0% |
| t | 1592137 | 5.9% |
| o | 1549174 | 5.8% |
| s | 1529432 | 5.7% |
| a | 1498632 | 5.6% |
| r | 1220911 | 4.6% |
| M | 808430 | 3.0% |
| Other values (89) | 9885640 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 17586035 | |
| Uppercase Letter | 4861930 | 18.2% |
| Space Separator | 3118394 | 11.6% |
| Other Punctuation | 1144505 | 4.3% |
| Dash Punctuation | 53401 | 0.2% |
| Decimal Number | 6866 | < 0.1% |
| Control | 6698 | < 0.1% |
| Open Punctuation | 669 | < 0.1% |
| Close Punctuation | 669 | < 0.1% |
| Math Symbol | 25 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 2082026 | |
| i | 1878666 | |
| n | 1615750 | |
| t | 1592137 | |
| o | 1549174 | |
| s | 1529432 | |
| a | 1498632 | |
| r | 1220911 | 6.9% |
| l | 767614 | 4.4% |
| h | 563615 | 3.2% |
| Other values (31) | 3288078 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 808430 | |
| S | 653530 | |
| B | 397798 | 8.2% |
| C | 364653 | 7.5% |
| F | 349172 | 7.2% |
| L | 335721 | 6.9% |
| U | 267026 | 5.5% |
| H | 212474 | 4.4% |
| R | 188625 | 3.9% |
| W | 154318 | 3.2% |
| Other values (17) | 1130183 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 741714 | |
| / | 238270 | 20.8% |
| & | 118056 | 10.3% |
| , | 45635 | 4.0% |
| ' | 383 | < 0.1% |
| : | 366 | < 0.1% |
| " | 36 | < 0.1% |
| ; | 26 | < 0.1% |
| ? | 15 | < 0.1% |
| # | 2 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1289 | |
| 2 | 1273 | |
| 0 | 1071 | |
| 9 | 974 | |
| 4 | 474 | 6.9% |
| 6 | 442 | 6.4% |
| 8 | 366 | 5.3% |
| 3 | 348 | 5.1% |
| 7 | 334 | 4.9% |
| 5 | 295 | 4.3% |
Control
| Value | Count | Frequency (%) |
| 6663 | ||
| 35 | 0.5% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 667 | |
| { | 2 | 0.3% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 667 | |
| } | 2 | 0.3% |
Math Symbol
| Value | Count | Frequency (%) |
| + | 21 | |
| = | 4 | 16.0% |
Space Separator
| Value | Count | Frequency (%) |
| 3118394 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 53401 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 22447965 | |
| Common | 4331227 | 16.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 2082026 | 9.3% |
| i | 1878666 | 8.4% |
| n | 1615750 | 7.2% |
| t | 1592137 | 7.1% |
| o | 1549174 | 6.9% |
| s | 1529432 | 6.8% |
| a | 1498632 | 6.7% |
| r | 1220911 | 5.4% |
| M | 808430 | 3.6% |
| l | 767614 | 3.4% |
| Other values (58) | 7905193 |
Common
| Value | Count | Frequency (%) |
| 3118394 | ||
| . | 741714 | 17.1% |
| / | 238270 | 5.5% |
| & | 118056 | 2.7% |
| - | 53401 | 1.2% |
| , | 45635 | 1.1% |
| 6663 | 0.2% | |
| 1 | 1289 | < 0.1% |
| 2 | 1273 | < 0.1% |
| 0 | 1071 | < 0.1% |
| Other values (21) | 5461 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 26778271 | |
| None | 921 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 3118394 | 11.6% | |
| e | 2082026 | 7.8% |
| i | 1878666 | 7.0% |
| n | 1615750 | 6.0% |
| t | 1592137 | 5.9% |
| o | 1549174 | 5.8% |
| s | 1529432 | 5.7% |
| a | 1498632 | 5.6% |
| r | 1220911 | 4.6% |
| M | 808430 | 3.0% |
| Other values (73) | 9884719 |
None
| Value | Count | Frequency (%) |
| é | 455 | |
| ü | 102 | 11.1% |
| á | 93 | 10.1% |
| ö | 65 | 7.1% |
| ä | 57 | 6.2% |
| ó | 53 | 5.8% |
| í | 49 | 5.3% |
| è | 14 | 1.5% |
| ñ | 12 | 1.3% |
| ç | 9 | 1.0% |
| Other values (6) | 12 | 1.3% |
individualCount
Text
| Distinct | 1067 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 156 |
| Missing (%) | < 0.1% |
| Memory size | 14.7 MiB |
Length
| Max length | 5 |
|---|---|
| Median length | 1 |
| Mean length | 1.10839112 |
| Min length | 1 |
Unique
| Unique | 413 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 11 |
| 3rd row | 1 |
| 4th row | 26 |
| 5th row | 1 |
| Value | Count | Frequency (%) |
| 1 | 995615 | |
| 2 | 289522 | 15.0% |
| 3 | 135746 | 7.0% |
| 4 | 99091 | 5.1% |
| 5 | 73915 | 3.8% |
| 6 | 51736 | 2.7% |
| 10 | 38942 | 2.0% |
| 7 | 31367 | 1.6% |
| 8 | 30163 | 1.6% |
| 9 | 18498 | 1.0% |
| Other values (1057) | 161310 | 8.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 1131407 | |
| 2 | 345437 | 16.2% |
| 3 | 162113 | 7.6% |
| 4 | 118945 | 5.6% |
| 5 | 110267 | 5.2% |
| 0 | 93489 | 4.4% |
| 6 | 64558 | 3.0% |
| 7 | 42168 | 2.0% |
| 8 | 40048 | 1.9% |
| 9 | 26224 | 1.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2134656 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1131407 | |
| 2 | 345437 | 16.2% |
| 3 | 162113 | 7.6% |
| 4 | 118945 | 5.6% |
| 5 | 110267 | 5.2% |
| 0 | 93489 | 4.4% |
| 6 | 64558 | 3.0% |
| 7 | 42168 | 2.0% |
| 8 | 40048 | 1.9% |
| 9 | 26224 | 1.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2134656 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 1131407 | |
| 2 | 345437 | 16.2% |
| 3 | 162113 | 7.6% |
| 4 | 118945 | 5.6% |
| 5 | 110267 | 5.2% |
| 0 | 93489 | 4.4% |
| 6 | 64558 | 3.0% |
| 7 | 42168 | 2.0% |
| 8 | 40048 | 1.9% |
| 9 | 26224 | 1.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2134656 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 1131407 | |
| 2 | 345437 | 16.2% |
| 3 | 162113 | 7.6% |
| 4 | 118945 | 5.6% |
| 5 | 110267 | 5.2% |
| 0 | 93489 | 4.4% |
| 6 | 64558 | 3.0% |
| 7 | 42168 | 2.0% |
| 8 | 40048 | 1.9% |
| 9 | 26224 | 1.2% |
sex
Text
Missing 
| Distinct | 299 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 1744021 |
| Missing (%) | 90.5% |
| Memory size | 14.7 MiB |
Length
| Max length | 130 |
|---|---|
| Median length | 76 |
| Mean length | 8.258635465 |
| Min length | 4 |
Unique
| Unique | 137 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | female |
|---|---|
| 2nd row | female |
| 3rd row | male; female |
| 4th row | male |
| 5th row | male |
| Value | Count | Frequency (%) |
| female | 137569 | |
| male | 121519 | |
| unknown | 1423 | 0.5% |
| hermaphrodite | 267 | 0.1% |
| 224 | 0.1% | |
| intersex | 146 | 0.1% |
| male/female | 101 | < 0.1% |
| female/male | 9 | < 0.1% |
| neuter | 1 | < 0.1% |
| imposex | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 397816 | |
| a | 259575 | |
| l | 259308 | |
| m | 253777 | |
| f | 128855 | 8.6% |
| ; | 96869 | 6.4% |
| 79220 | 5.3% | |
| F | 8824 | 0.6% |
| M | 5799 | 0.4% |
| n | 4416 | 0.3% |
| Other values (15) | 8943 | 0.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1311268 | |
| Other Punctuation | 96979 | 6.5% |
| Space Separator | 79220 | 5.3% |
| Uppercase Letter | 15935 | 1.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 397816 | |
| a | 259575 | |
| l | 259308 | |
| m | 253777 | |
| f | 128855 | 9.8% |
| n | 4416 | 0.3% |
| o | 1691 | 0.1% |
| k | 1423 | 0.1% |
| w | 1423 | 0.1% |
| r | 681 | 0.1% |
| Other values (8) | 2303 | 0.2% |
Uppercase Letter
| Value | Count | Frequency (%) |
| F | 8824 | |
| M | 5799 | |
| U | 1306 | 8.2% |
| I | 6 | < 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| ; | 96869 | |
| / | 110 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 79220 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1327203 | |
| Common | 176199 | 11.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 397816 | |
| a | 259575 | |
| l | 259308 | |
| m | 253777 | |
| f | 128855 | 9.7% |
| F | 8824 | 0.7% |
| M | 5799 | 0.4% |
| n | 4416 | 0.3% |
| o | 1691 | 0.1% |
| k | 1423 | 0.1% |
| Other values (12) | 5719 | 0.4% |
Common
| Value | Count | Frequency (%) |
| ; | 96869 | |
| 79220 | ||
| / | 110 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1503402 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 397816 | |
| a | 259575 | |
| l | 259308 | |
| m | 253777 | |
| f | 128855 | 8.6% |
| ; | 96869 | 6.4% |
| 79220 | 5.3% | |
| F | 8824 | 0.6% |
| M | 5799 | 0.4% |
| n | 4416 | 0.3% |
| Other values (15) | 8943 | 0.6% |
lifeStage
Text
Missing 
| Distinct | 852 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 1837065 |
| Missing (%) | 95.4% |
| Memory size | 14.7 MiB |
Length
| Max length | 97 |
|---|---|
| Median length | 76 |
| Mean length | 9.342240101 |
| Min length | 1 |
Unique
| Unique | 377 ? |
|---|---|
| Unique (%) | 0.4% |
Sample
| 1st row | ovigerous |
|---|---|
| 2nd row | I |
| 3rd row | larva |
| 4th row | juvenile |
| 5th row | larvae |
| Value | Count | Frequency (%) |
| juvenile | 43771 | |
| 16544 | 12.7% | |
| ovigerous | 15621 | 12.0% |
| adult | 15324 | 11.7% |
| ii | 11920 | 9.1% |
| i | 9497 | 7.3% |
| larvae | 7056 | 5.4% |
| immature | 1741 | 1.3% |
| larva | 1318 | 1.0% |
| copepodid | 666 | 0.5% |
| Other values (173) | 7154 | 5.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 117680 | |
| u | 78513 | |
| l | 69686 | 8.4% |
| ; | 68834 | 8.3% |
| v | 68052 | 8.2% |
| i | 64284 | 7.7% |
| n | 45375 | 5.5% |
| j | 43457 | 5.2% |
| 41616 | 5.0% | |
| a | 40329 | 4.9% |
| Other values (40) | 193596 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 685260 | |
| Other Punctuation | 68876 | 8.3% |
| Space Separator | 41616 | 5.0% |
| Uppercase Letter | 35133 | 4.2% |
| Dash Punctuation | 292 | < 0.1% |
| Decimal Number | 245 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 117680 | |
| u | 78513 | |
| l | 69686 | |
| v | 68052 | |
| i | 64284 | |
| n | 45375 | 6.6% |
| j | 43457 | 6.3% |
| a | 40329 | 5.9% |
| o | 36937 | 5.4% |
| r | 29747 | 4.3% |
| Other values (16) | 91200 |
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 33827 | |
| V | 527 | 1.5% |
| J | 356 | 1.0% |
| A | 302 | 0.9% |
| C | 45 | 0.1% |
| X | 22 | 0.1% |
| L | 21 | 0.1% |
| M | 19 | 0.1% |
| P | 14 | < 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| ; | 68834 | |
| ' | 19 | < 0.1% |
| & | 13 | < 0.1% |
| . | 4 | < 0.1% |
| , | 4 | < 0.1% |
| / | 1 | < 0.1% |
| ? | 1 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 116 | |
| 2 | 62 | |
| 3 | 34 | 13.9% |
| 4 | 24 | 9.8% |
| 5 | 7 | 2.9% |
| 6 | 2 | 0.8% |
Space Separator
| Value | Count | Frequency (%) |
| 41616 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 292 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 720393 | |
| Common | 111029 | 13.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 117680 | |
| u | 78513 | |
| l | 69686 | |
| v | 68052 | |
| i | 64284 | |
| n | 45375 | 6.3% |
| j | 43457 | 6.0% |
| a | 40329 | 5.6% |
| o | 36937 | 5.1% |
| I | 33827 | 4.7% |
| Other values (25) | 122253 |
Common
| Value | Count | Frequency (%) |
| ; | 68834 | |
| 41616 | ||
| - | 292 | 0.3% |
| 1 | 116 | 0.1% |
| 2 | 62 | 0.1% |
| 3 | 34 | < 0.1% |
| 4 | 24 | < 0.1% |
| ' | 19 | < 0.1% |
| & | 13 | < 0.1% |
| 5 | 7 | < 0.1% |
| Other values (5) | 12 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 831403 | |
| None | 19 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 117680 | |
| u | 78513 | |
| l | 69686 | 8.4% |
| ; | 68834 | 8.3% |
| v | 68052 | 8.2% |
| i | 64284 | 7.7% |
| n | 45375 | 5.5% |
| j | 43457 | 5.2% |
| 41616 | 5.0% | |
| a | 40329 | 4.9% |
| Other values (39) | 193577 |
None
| Value | Count | Frequency (%) |
| ü | 19 |
occurrenceStatus
Text
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 1926059 |
| Missing (%) | > 99.9% |
| Memory size | 14.7 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 1993-09-09 |
|---|---|
| 2nd row | 1938-09-22 |
| Value | Count | Frequency (%) |
| 1993-09-09 | 1 | |
| 1938-09-22 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 9 | 6 | |
| - | 4 | |
| 0 | 3 | |
| 1 | 2 | 10.0% |
| 3 | 2 | 10.0% |
| 2 | 2 | 10.0% |
| 8 | 1 | 5.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 16 | |
| Dash Punctuation | 4 | 20.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 9 | 6 | |
| 0 | 3 | |
| 1 | 2 | 12.5% |
| 3 | 2 | 12.5% |
| 2 | 2 | 12.5% |
| 8 | 1 | 6.2% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 4 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 20 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 9 | 6 | |
| - | 4 | |
| 0 | 3 | |
| 1 | 2 | 10.0% |
| 3 | 2 | 10.0% |
| 2 | 2 | 10.0% |
| 8 | 1 | 5.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 20 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 9 | 6 | |
| - | 4 | |
| 0 | 3 | |
| 1 | 2 | 10.0% |
| 3 | 2 | 10.0% |
| 2 | 2 | 10.0% |
| 8 | 1 | 5.0% |
preparations
Text
| Distinct | 527 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1860 |
| Missing (%) | 0.1% |
| Memory size | 14.7 MiB |
Length
| Max length | 167 |
|---|---|
| Median length | 157 |
| Mean length | 10.12227257 |
| Min length | 3 |
Unique
| Unique | 212 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Alcohol (Ethanol) |
|---|---|
| 2nd row | Dry |
| 3rd row | Alcohol (Ethanol) |
| 4th row | Dry |
| 5th row | Dry |
| Value | Count | Frequency (%) |
| ethanol | 906962 | |
| dry | 902181 | |
| alcohol | 897467 | |
| slide | 129625 | 4.4% |
| 19547 | 0.7% | |
| 95 | 16839 | 0.6% |
| formalin | 12584 | 0.4% |
| biorepository | 12371 | 0.4% |
| isopropyl | 10052 | 0.3% |
| sorting | 6035 | 0.2% |
| Other values (40) | 31868 | 1.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| l | 2865932 | |
| o | 2796698 | |
| h | 1805994 | 9.3% |
| 1021330 | 5.2% | |
| r | 954159 | 4.9% |
| t | 939400 | 4.8% |
| n | 936694 | 4.8% |
| a | 925585 | 4.8% |
| y | 923820 | 4.7% |
| E | 912862 | 4.7% |
| Other values (43) | 5394813 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 13610860 | |
| Uppercase Letter | 2924613 | 15.0% |
| Space Separator | 1021330 | 5.2% |
| Close Punctuation | 887415 | 4.6% |
| Open Punctuation | 887415 | 4.6% |
| Other Punctuation | 86944 | 0.4% |
| Decimal Number | 39163 | 0.2% |
| Dash Punctuation | 19547 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| l | 2865932 | |
| o | 2796698 | |
| h | 1805994 | |
| r | 954159 | 7.0% |
| t | 939400 | 6.9% |
| n | 936694 | 6.9% |
| a | 925585 | 6.8% |
| y | 923820 | 6.8% |
| c | 898488 | 6.6% |
| i | 181329 | 1.3% |
| Other values (12) | 382761 | 2.8% |
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 912862 | |
| D | 902455 | |
| A | 898567 | |
| S | 153297 | 5.2% |
| I | 13800 | 0.5% |
| F | 12983 | 0.4% |
| B | 12729 | 0.4% |
| M | 5938 | 0.2% |
| R | 4592 | 0.2% |
| Y | 4591 | 0.2% |
| Other values (9) | 2799 | 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 9 | 18431 | |
| 5 | 17781 | |
| 0 | 1802 | 4.6% |
| 8 | 1080 | 2.8% |
| 1 | 36 | 0.1% |
| 2 | 33 | 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| ; | 67396 | |
| % | 19548 | 22.5% |
Space Separator
| Value | Count | Frequency (%) |
| 1021330 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 887415 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 887415 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 19547 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 16535473 | |
| Common | 2941814 | 15.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| l | 2865932 | |
| o | 2796698 | |
| h | 1805994 | |
| r | 954159 | 5.8% |
| t | 939400 | 5.7% |
| n | 936694 | 5.7% |
| a | 925585 | 5.6% |
| y | 923820 | 5.6% |
| E | 912862 | 5.5% |
| D | 902455 | 5.5% |
| Other values (31) | 2571874 |
Common
| Value | Count | Frequency (%) |
| 1021330 | ||
| ) | 887415 | |
| ( | 887415 | |
| ; | 67396 | 2.3% |
| % | 19548 | 0.7% |
| - | 19547 | 0.7% |
| 9 | 18431 | 0.6% |
| 5 | 17781 | 0.6% |
| 0 | 1802 | 0.1% |
| 8 | 1080 | < 0.1% |
| Other values (2) | 69 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 19477287 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| l | 2865932 | |
| o | 2796698 | |
| h | 1805994 | 9.3% |
| 1021330 | 5.2% | |
| r | 954159 | 4.9% |
| t | 939400 | 4.8% |
| n | 936694 | 4.8% |
| a | 925585 | 4.8% |
| y | 923820 | 4.7% |
| E | 912862 | 4.7% |
| Other values (43) | 5394813 |
disposition
Text
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 1926059 |
| Missing (%) | > 99.9% |
| Memory size | 14.7 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 252 |
|---|---|
| 2nd row | 265 |
| Value | Count | Frequency (%) |
| 252 | 1 | |
| 265 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 3 | |
| 5 | 2 | |
| 6 | 1 | 16.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 6 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 3 | |
| 5 | 2 | |
| 6 | 1 | 16.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 6 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 3 | |
| 5 | 2 | |
| 6 | 1 | 16.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 3 | |
| 5 | 2 | |
| 6 | 1 | 16.7% |
associatedMedia
Text
Missing 
| Distinct | 242386 |
|---|---|
| Distinct (%) | 95.5% |
| Missing | 1672204 |
| Missing (%) | 86.8% |
| Memory size | 14.7 MiB |
Length
| Max length | 1629 |
|---|---|
| Median length | 49 |
| Mean length | 50.86072868 |
| Min length | 3 |
Unique
| Unique | 241663 ? |
|---|---|
| Unique (%) | 95.2% |
Sample
| 1st row | https://collections.nmnh.si.edu/media/?i=12038700 |
|---|---|
| 2nd row | https://collections.nmnh.si.edu/media/?i=16053651 |
| 3rd row | https://collections.nmnh.si.edu/media/?i=18190 |
| 4th row | https://collections.nmnh.si.edu/media/?i=55934 |
| 5th row | https://collections.nmnh.si.edu/media/?i=10165617 |
| Value | Count | Frequency (%) |
| https://collections.nmnh.si.edu/media/?i=10674432 | 1623 | 0.5% |
| https://collections.nmnh.si.edu/media/?i=10689696 | 1456 | 0.4% |
| https://collections.nmnh.si.edu/media/?i=10696300 | 1243 | 0.4% |
| https://collections.nmnh.si.edu/media/?i=10684813 | 919 | 0.3% |
| https://collections.nmnh.si.edu/media/?i=10669453 | 853 | 0.3% |
| https://collections.nmnh.si.edu/media/?i=10643018 | 690 | 0.2% |
| https://collections.nmnh.si.edu/media/?i=10676407 | 540 | 0.2% |
| https://collections.nmnh.si.edu/media/?i=11455178 | 456 | 0.1% |
| https://collections.nmnh.si.edu/media/?i=10865403 | 387 | 0.1% |
| https://collections.nmnh.si.edu/media/?i=10803950 | 271 | 0.1% |
| Other values (311642) | 318373 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 1015420 | 7.9% |
| / | 1015420 | 7.9% |
| t | 761565 | 5.9% |
| s | 761565 | 5.9% |
| . | 761565 | 5.9% |
| n | 761565 | 5.9% |
| e | 761565 | 5.9% |
| h | 507710 | 3.9% |
| d | 507710 | 3.9% |
| m | 507710 | 3.9% |
| Other values (21) | 5549557 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 7869505 | |
| Other Punctuation | 2357649 | 18.3% |
| Decimal Number | 2357389 | 18.3% |
| Math Symbol | 253855 | 2.0% |
| Space Separator | 72954 | 0.6% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 1015420 | |
| t | 761565 | |
| s | 761565 | |
| n | 761565 | |
| e | 761565 | |
| h | 507710 | 6.5% |
| d | 507710 | 6.5% |
| m | 507710 | 6.5% |
| l | 507710 | 6.5% |
| o | 507710 | 6.5% |
| Other values (4) | 1269275 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 430632 | |
| 5 | 257203 | |
| 4 | 232520 | |
| 6 | 226636 | |
| 0 | 219805 | |
| 8 | 208267 | |
| 3 | 206069 | |
| 2 | 202946 | |
| 7 | 194481 | |
| 9 | 178830 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 1015420 | |
| . | 761565 | |
| ? | 253855 | 10.8% |
| : | 253855 | 10.8% |
| ; | 72954 | 3.1% |
Math Symbol
| Value | Count | Frequency (%) |
| = | 253855 |
Space Separator
| Value | Count | Frequency (%) |
| 72954 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 7869505 | |
| Common | 5041847 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| / | 1015420 | |
| . | 761565 | |
| 1 | 430632 | 8.5% |
| 5 | 257203 | 5.1% |
| ? | 253855 | 5.0% |
| = | 253855 | 5.0% |
| : | 253855 | 5.0% |
| 4 | 232520 | 4.6% |
| 6 | 226636 | 4.5% |
| 0 | 219805 | 4.4% |
| Other values (7) | 1136501 |
Latin
| Value | Count | Frequency (%) |
| i | 1015420 | |
| t | 761565 | |
| s | 761565 | |
| n | 761565 | |
| e | 761565 | |
| h | 507710 | 6.5% |
| d | 507710 | 6.5% |
| m | 507710 | 6.5% |
| l | 507710 | 6.5% |
| o | 507710 | 6.5% |
| Other values (4) | 1269275 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 12911352 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 1015420 | 7.9% |
| / | 1015420 | 7.9% |
| t | 761565 | 5.9% |
| s | 761565 | 5.9% |
| . | 761565 | 5.9% |
| n | 761565 | 5.9% |
| e | 761565 | 5.9% |
| h | 507710 | 3.9% |
| d | 507710 | 3.9% |
| m | 507710 | 3.9% |
| Other values (21) | 5549557 |
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 1926059 |
| Missing (%) | > 99.9% |
| Memory size | 14.7 MiB |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 1993 |
|---|---|
| 2nd row | 1938 |
| Value | Count | Frequency (%) |
| 1993 | 1 | |
| 1938 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 9 | 3 | |
| 1 | 2 | |
| 3 | 2 | |
| 8 | 1 | 12.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 8 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 9 | 3 | |
| 1 | 2 | |
| 3 | 2 | |
| 8 | 1 | 12.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 8 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 9 | 3 | |
| 1 | 2 | |
| 3 | 2 | |
| 8 | 1 | 12.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 9 | 3 | |
| 1 | 2 | |
| 3 | 2 | |
| 8 | 1 | 12.5% |
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 50.0% |
| Missing | 1926059 |
| Missing (%) | > 99.9% |
| Memory size | 14.7 MiB |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 9 |
|---|---|
| 2nd row | 9 |
| Value | Count | Frequency (%) |
| 9 | 2 |
Most occurring characters
| Value | Count | Frequency (%) |
| 9 | 2 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 9 | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 9 | 2 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 9 | 2 |
Missing 
| Distinct | 5099 |
|---|---|
| Distinct (%) | 99.5% |
| Missing | 1920937 |
| Missing (%) | 99.7% |
| Memory size | 14.7 MiB |
Length
| Max length | 1349 |
|---|---|
| Median length | 49 |
| Mean length | 85.49824356 |
| Min length | 1 |
Unique
| Unique | 5084 ? |
|---|---|
| Unique (%) | 99.2% |
Sample
| 1st row | https://www.ncbi.nlm.nih.gov/gquery?term=AY426351|https://www.ncbi.nlm.nih.gov/gquery?term=AY379442|https://www.ncbi.nlm.nih.gov/gquery?term=AY426385 |
|---|---|
| 2nd row | https://www.ncbi.nlm.nih.gov/gquery?term=MH825989 |
| 3rd row | https://www.ncbi.nlm.nih.gov/gquery?term=MT223244 |
| 4th row | https://www.ncbi.nlm.nih.gov/gquery?term=MH826372 |
| 5th row | https://www.ncbi.nlm.nih.gov/gquery?term=KT792656 |
| Value | Count | Frequency (%) |
| https://www.ncbi.nlm.nih.gov/gquery?term=km521547 | 12 | 0.2% |
| https://www.ncbi.nlm.nih.gov/gquery?term=ay643524 | 2 | < 0.1% |
| https://www.ncbi.nlm.nih.gov/gquery?term=ef060028|https://www.ncbi.nlm.nih.gov/gquery?term=kx362271 | 2 | < 0.1% |
| https://www.ncbi.nlm.nih.gov/gquery?term=fj172481 | 2 | < 0.1% |
| https://www.ncbi.nlm.nih.gov/gquery?term=kx832080 | 2 | < 0.1% |
| https://www.ncbi.nlm.nih.gov/gquery?term=srr9613700 | 2 | < 0.1% |
| https://www.ncbi.nlm.nih.gov/gquery?term=jq307001 | 2 | < 0.1% |
| https://www.ncbi.nlm.nih.gov/gquery?term=ku285912 | 2 | < 0.1% |
| https://www.ncbi.nlm.nih.gov/gquery?term=eu863366|https://www.ncbi.nlm.nih.gov/gquery?term=eu863300 | 2 | < 0.1% |
| https://www.ncbi.nlm.nih.gov/gquery?term=mh244118 | 2 | < 0.1% |
| Other values (5089) | 5094 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 35419 | 8.1% |
| t | 26562 | 6.1% |
| / | 26562 | 6.1% |
| w | 26562 | 6.1% |
| n | 26562 | 6.1% |
| h | 17708 | 4.0% |
| r | 17708 | 4.0% |
| i | 17708 | 4.0% |
| e | 17708 | 4.0% |
| m | 17708 | 4.0% |
| Other values (51) | 207886 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 274474 | |
| Other Punctuation | 79689 | 18.2% |
| Decimal Number | 53458 | 12.2% |
| Uppercase Letter | 17884 | 4.1% |
| Math Symbol | 12586 | 2.9% |
| Dash Punctuation | 1 | < 0.1% |
| Connector Punctuation | 1 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| K | 3906 | |
| M | 3764 | |
| W | 1587 | |
| U | 1539 | 8.6% |
| F | 833 | 4.7% |
| J | 772 | 4.3% |
| X | 719 | 4.0% |
| C | 697 | 3.9% |
| T | 538 | 3.0% |
| H | 533 | 3.0% |
| Other values (14) | 2996 |
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 26562 | 9.7% |
| w | 26562 | 9.7% |
| n | 26562 | 9.7% |
| h | 17708 | 6.5% |
| r | 17708 | 6.5% |
| i | 17708 | 6.5% |
| e | 17708 | 6.5% |
| m | 17708 | 6.5% |
| g | 17708 | 6.5% |
| q | 8854 | 3.2% |
| Other values (9) | 79686 |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 7336 | |
| 8 | 6190 | |
| 0 | 5590 | |
| 4 | 5209 | |
| 6 | 5207 | |
| 5 | 5041 | |
| 3 | 4920 | |
| 9 | 4838 | |
| 1 | 4744 | |
| 7 | 4383 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 35419 | |
| / | 26562 | |
| ? | 8854 | 11.1% |
| : | 8854 | 11.1% |
Math Symbol
| Value | Count | Frequency (%) |
| = | 8854 | |
| | | 3732 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 292358 | |
| Common | 145735 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 26562 | 9.1% |
| w | 26562 | 9.1% |
| n | 26562 | 9.1% |
| h | 17708 | 6.1% |
| r | 17708 | 6.1% |
| i | 17708 | 6.1% |
| e | 17708 | 6.1% |
| m | 17708 | 6.1% |
| g | 17708 | 6.1% |
| q | 8854 | 3.0% |
| Other values (33) | 97570 |
Common
| Value | Count | Frequency (%) |
| . | 35419 | |
| / | 26562 | |
| = | 8854 | 6.1% |
| ? | 8854 | 6.1% |
| : | 8854 | 6.1% |
| 2 | 7336 | 5.0% |
| 8 | 6190 | 4.2% |
| 0 | 5590 | 3.8% |
| 4 | 5209 | 3.6% |
| 6 | 5207 | 3.6% |
| Other values (8) | 27660 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 438093 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 35419 | 8.1% |
| t | 26562 | 6.1% |
| / | 26562 | 6.1% |
| w | 26562 | 6.1% |
| n | 26562 | 6.1% |
| h | 17708 | 4.0% |
| r | 17708 | 4.0% |
| i | 17708 | 4.0% |
| e | 17708 | 4.0% |
| m | 17708 | 4.0% |
| Other values (51) | 207886 |
Missing 
| Distinct | 384844 |
|---|---|
| Distinct (%) | 49.2% |
| Missing | 1144278 |
| Missing (%) | 59.4% |
| Memory size | 14.7 MiB |
Length
| Max length | 29813 |
|---|---|
| Median length | 1371 |
| Mean length | 61.44131172 |
| Min length | 1 |
Unique
| Unique | 322638 ? |
|---|---|
| Unique (%) | 41.3% |
Sample
| 1st row | Jewett.; Stearns. |
|---|---|
| 2nd row | Bartsch |
| 3rd row | 15 Nov. 1973; Jones, Dawson, del Rosario; Fitzgerald; NMNH-STRI Survey |
| 4th row | U. S. B. Fish |
| 5th row | C.R. Laws |
| Value | Count | Frequency (%) |
| coll | 143172 | 2.1% |
| of | 115241 | 1.7% |
| and | 111346 | 1.7% |
| a | 107275 | 1.6% |
| by | 89596 | 1.3% |
| 87789 | 1.3% | |
| 2 | 65611 | 1.0% |
| 3 | 63122 | 0.9% |
| was | 62148 | 0.9% |
| formalin | 58887 | 0.9% |
| Other values (237636) | 5772371 |
Most occurring characters
| Value | Count | Frequency (%) |
| 5890993 | 12.3% | |
| e | 2965089 | 6.2% |
| o | 2600658 | 5.4% |
| a | 2412401 | 5.0% |
| i | 2008687 | 4.2% |
| t | 1976952 | 4.1% |
| n | 1974652 | 4.1% |
| r | 1876623 | 3.9% |
| s | 1857069 | 3.9% |
| l | 1811612 | 3.8% |
| Other values (123) | 22659037 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 27315643 | |
| Space Separator | 5890993 | 12.3% |
| Uppercase Letter | 5675734 | 11.8% |
| Other Punctuation | 4998736 | 10.4% |
| Decimal Number | 3433710 | 7.1% |
| Dash Punctuation | 298658 | 0.6% |
| Open Punctuation | 185583 | 0.4% |
| Close Punctuation | 185432 | 0.4% |
| Control | 21090 | < 0.1% |
| Math Symbol | 15137 | < 0.1% |
| Other values (8) | 13057 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 2965089 | |
| o | 2600658 | 9.5% |
| a | 2412401 | 8.8% |
| i | 2008687 | 7.4% |
| t | 1976952 | 7.2% |
| n | 1974652 | 7.2% |
| r | 1876623 | 6.9% |
| s | 1857069 | 6.8% |
| l | 1811612 | 6.6% |
| d | 1161297 | 4.3% |
| Other values (32) | 6670603 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 695582 | 12.3% |
| S | 674934 | 11.9% |
| B | 358975 | 6.3% |
| F | 347135 | 6.1% |
| P | 325526 | 5.7% |
| N | 311183 | 5.5% |
| M | 289244 | 5.1% |
| A | 261990 | 4.6% |
| R | 238530 | 4.2% |
| H | 231655 | 4.1% |
| Other values (17) | 1940980 |
Other Punctuation
| Value | Count | Frequency (%) |
| " | 1192398 | |
| . | 1191661 | |
| ; | 1043726 | |
| , | 582173 | |
| : | 567803 | |
| % | 166876 | 3.3% |
| / | 97178 | 1.9% |
| ! | 65383 | 1.3% |
| ' | 33797 | 0.7% |
| # | 25846 | 0.5% |
| Other values (6) | 31895 | 0.6% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 684528 | |
| 2 | 446207 | |
| 9 | 386796 | |
| 0 | 370228 | |
| 3 | 301861 | |
| 7 | 286303 | |
| 5 | 255519 | 7.4% |
| 6 | 251045 | 7.3% |
| 4 | 238239 | 6.9% |
| 8 | 212984 | 6.2% |
Math Symbol
| Value | Count | Frequency (%) |
| + | 11234 | |
| = | 2004 | 13.2% |
| | | 1638 | 10.8% |
| > | 140 | 0.9% |
| ~ | 94 | 0.6% |
| < | 23 | 0.2% |
| ± | 2 | < 0.1% |
| × | 2 | < 0.1% |
Other Symbol
| Value | Count | Frequency (%) |
| ° | 3563 | |
| ♂ | 91 | 2.5% |
| ♀ | 49 | 1.3% |
| ⚥ | 6 | 0.2% |
| © | 2 | 0.1% |
| ® | 1 | < 0.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 298188 | |
| – | 469 | 0.2% |
| — | 1 | < 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 95133 | |
| { | 87809 | |
| [ | 2641 | 1.4% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 95004 | |
| } | 87803 | |
| ] | 2625 | 1.4% |
Other Number
| Value | Count | Frequency (%) |
| ½ | 1 | |
| ¼ | 1 | |
| ³ | 1 |
Control
| Value | Count | Frequency (%) |
| 20979 | ||
| 111 | 0.5% |
Currency Symbol
| Value | Count | Frequency (%) |
| $ | 383 | |
| € | 2 | 0.5% |
Final Punctuation
| Value | Count | Frequency (%) |
| ” | 213 | |
| » | 1 | 0.5% |
Initial Punctuation
| Value | Count | Frequency (%) |
| “ | 213 | |
| « | 1 | 0.5% |
Space Separator
| Value | Count | Frequency (%) |
| 5890993 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 7399 |
Other Letter
| Value | Count | Frequency (%) |
| º | 1128 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ^ | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 32992469 | |
| Common | 15041296 | |
| Greek | 8 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 2965089 | 9.0% |
| o | 2600658 | 7.9% |
| a | 2412401 | 7.3% |
| i | 2008687 | 6.1% |
| t | 1976952 | 6.0% |
| n | 1974652 | 6.0% |
| r | 1876623 | 5.7% |
| s | 1857069 | 5.6% |
| l | 1811612 | 5.5% |
| d | 1161297 | 3.5% |
| Other values (57) | 12347429 |
Common
| Value | Count | Frequency (%) |
| 5890993 | ||
| " | 1192398 | 7.9% |
| . | 1191661 | 7.9% |
| ; | 1043726 | 6.9% |
| 1 | 684528 | 4.6% |
| , | 582173 | 3.9% |
| : | 567803 | 3.8% |
| 2 | 446207 | 3.0% |
| 9 | 386796 | 2.6% |
| 0 | 370228 | 2.5% |
| Other values (54) | 2684783 |
Greek
| Value | Count | Frequency (%) |
| μ | 7 | |
| π | 1 | 12.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 48027274 | |
| None | 5313 | < 0.1% |
| Punctuation | 1038 | < 0.1% |
| Misc Symbols | 146 | < 0.1% |
| Currency Symbols | 2 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 5890993 | 12.3% | |
| e | 2965089 | 6.2% |
| o | 2600658 | 5.4% |
| a | 2412401 | 5.0% |
| i | 2008687 | 4.2% |
| t | 1976952 | 4.1% |
| n | 1974652 | 4.1% |
| r | 1876623 | 3.9% |
| s | 1857069 | 3.9% |
| l | 1811612 | 3.8% |
| Other values (86) | 22652538 |
None
| Value | Count | Frequency (%) |
| ° | 3563 | |
| º | 1128 | 21.2% |
| é | 384 | 7.2% |
| ü | 87 | 1.6% |
| µ | 28 | 0.5% |
| ö | 28 | 0.5% |
| ã | 14 | 0.3% |
| à | 12 | 0.2% |
| ó | 11 | 0.2% |
| á | 9 | 0.2% |
| Other values (18) | 49 | 0.9% |
Punctuation
| Value | Count | Frequency (%) |
| – | 469 | |
| ” | 213 | |
| “ | 213 | |
| … | 142 | 13.7% |
| — | 1 | 0.1% |
Misc Symbols
| Value | Count | Frequency (%) |
| ♂ | 91 | |
| ♀ | 49 | |
| ⚥ | 6 | 4.1% |
Currency Symbols
| Value | Count | Frequency (%) |
| € | 2 |
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 1926059 |
| Missing (%) | > 99.9% |
| Memory size | 14.7 MiB |
Length
| Max length | 62 |
|---|---|
| Median length | 48.5 |
| Mean length | 48.5 |
| Min length | 35 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | North America, North Pacific Ocean, Gulf Of California, Mexico |
|---|---|
| 2nd row | North America, United States, Texas |
| Value | Count | Frequency (%) |
| north | 3 | |
| america | 2 | |
| pacific | 1 | 7.1% |
| ocean | 1 | 7.1% |
| gulf | 1 | 7.1% |
| of | 1 | 7.1% |
| california | 1 | 7.1% |
| mexico | 1 | 7.1% |
| united | 1 | 7.1% |
| states | 1 | 7.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 12 | 12.4% | |
| a | 8 | 8.2% |
| i | 8 | 8.2% |
| e | 7 | 7.2% |
| r | 6 | 6.2% |
| t | 6 | 6.2% |
| c | 6 | 6.2% |
| o | 5 | 5.2% |
| , | 5 | 5.2% |
| f | 4 | 4.1% |
| Other values (18) | 30 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 66 | |
| Uppercase Letter | 14 | 14.4% |
| Space Separator | 12 | 12.4% |
| Other Punctuation | 5 | 5.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 8 | |
| i | 8 | |
| e | 7 | |
| r | 6 | |
| t | 6 | |
| c | 6 | |
| o | 5 | |
| f | 4 | 6.1% |
| n | 3 | 4.5% |
| h | 3 | 4.5% |
| Other values (6) | 10 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 3 | |
| A | 2 | |
| O | 2 | |
| P | 1 | 7.1% |
| G | 1 | 7.1% |
| C | 1 | 7.1% |
| M | 1 | 7.1% |
| U | 1 | 7.1% |
| S | 1 | 7.1% |
| T | 1 | 7.1% |
Space Separator
| Value | Count | Frequency (%) |
| 12 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 5 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 80 | |
| Common | 17 | 17.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 8 | 10.0% |
| i | 8 | 10.0% |
| e | 7 | 8.8% |
| r | 6 | 7.5% |
| t | 6 | 7.5% |
| c | 6 | 7.5% |
| o | 5 | 6.2% |
| f | 4 | 5.0% |
| n | 3 | 3.8% |
| N | 3 | 3.8% |
| Other values (16) | 24 |
Common
| Value | Count | Frequency (%) |
| 12 | ||
| , | 5 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 97 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 12 | 12.4% | |
| a | 8 | 8.2% |
| i | 8 | 8.2% |
| e | 7 | 7.2% |
| r | 6 | 6.2% |
| t | 6 | 6.2% |
| c | 6 | 6.2% |
| o | 5 | 5.2% |
| , | 5 | 5.2% |
| f | 4 | 4.1% |
| Other values (18) | 30 |
verbatimLabel
Text
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 1926059 |
| Missing (%) | > 99.9% |
| Memory size | 14.7 MiB |
Length
| Max length | 34 |
|---|---|
| Median length | 23.5 |
| Mean length | 23.5 |
| Min length | 13 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | North America, North Pacific Ocean |
|---|---|
| 2nd row | North America |
| Value | Count | Frequency (%) |
| north | 3 | |
| america | 2 | |
| pacific | 1 | 14.3% |
| ocean | 1 | 14.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| r | 5 | |
| c | 5 | |
| 5 | ||
| a | 4 | |
| i | 4 | |
| N | 3 | 6.4% |
| o | 3 | 6.4% |
| e | 3 | 6.4% |
| h | 3 | 6.4% |
| t | 3 | 6.4% |
| Other values (7) | 9 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 34 | |
| Uppercase Letter | 7 | 14.9% |
| Space Separator | 5 | 10.6% |
| Other Punctuation | 1 | 2.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 5 | |
| c | 5 | |
| a | 4 | |
| i | 4 | |
| o | 3 | |
| e | 3 | |
| h | 3 | |
| t | 3 | |
| m | 2 | 5.9% |
| f | 1 | 2.9% |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 3 | |
| A | 2 | |
| P | 1 | 14.3% |
| O | 1 | 14.3% |
Space Separator
| Value | Count | Frequency (%) |
| 5 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 41 | |
| Common | 6 | 12.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| r | 5 | |
| c | 5 | |
| a | 4 | |
| i | 4 | |
| N | 3 | |
| o | 3 | |
| e | 3 | |
| h | 3 | |
| t | 3 | |
| m | 2 | 4.9% |
| Other values (5) | 6 |
Common
| Value | Count | Frequency (%) |
| 5 | ||
| , | 1 | 16.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 47 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| r | 5 | |
| c | 5 | |
| 5 | ||
| a | 4 | |
| i | 4 | |
| N | 3 | 6.4% |
| o | 3 | 6.4% |
| e | 3 | 6.4% |
| h | 3 | 6.4% |
| t | 3 | 6.4% |
| Other values (7) | 9 |
materialSampleID
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 1926060 |
| Missing (%) | > 99.9% |
| Memory size | 14.7 MiB |
Length
| Max length | 39 |
|---|---|
| Median length | 39 |
| Mean length | 39 |
| Min length | 39 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | North Pacific Ocean, Gulf Of California |
|---|
| Value | Count | Frequency (%) |
| north | 1 | |
| pacific | 1 | |
| ocean | 1 | |
| gulf | 1 | |
| of | 1 | |
| california | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 5 | ||
| i | 4 | |
| a | 4 | |
| f | 4 | |
| c | 3 | 7.7% |
| n | 2 | 5.1% |
| r | 2 | 5.1% |
| l | 2 | 5.1% |
| o | 2 | 5.1% |
| O | 2 | 5.1% |
| Other values (9) | 9 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 27 | |
| Uppercase Letter | 6 | 15.4% |
| Space Separator | 5 | 12.8% |
| Other Punctuation | 1 | 2.6% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 4 | |
| a | 4 | |
| f | 4 | |
| c | 3 | |
| n | 2 | |
| r | 2 | |
| l | 2 | |
| o | 2 | |
| u | 1 | 3.7% |
| e | 1 | 3.7% |
| Other values (2) | 2 |
Uppercase Letter
| Value | Count | Frequency (%) |
| O | 2 | |
| G | 1 | |
| N | 1 | |
| P | 1 | |
| C | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 5 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 33 | |
| Common | 6 | 15.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 4 | |
| a | 4 | |
| f | 4 | |
| c | 3 | |
| n | 2 | 6.1% |
| r | 2 | 6.1% |
| l | 2 | 6.1% |
| o | 2 | 6.1% |
| O | 2 | 6.1% |
| u | 1 | 3.0% |
| Other values (7) | 7 |
Common
| Value | Count | Frequency (%) |
| 5 | ||
| , | 1 | 16.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 39 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 5 | ||
| i | 4 | |
| a | 4 | |
| f | 4 | |
| c | 3 | 7.7% |
| n | 2 | 5.1% |
| r | 2 | 5.1% |
| l | 2 | 5.1% |
| o | 2 | 5.1% |
| O | 2 | 5.1% |
| Other values (9) | 9 |
eventType
Text
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 1926059 |
| Missing (%) | > 99.9% |
| Memory size | 14.7 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 9.5 |
| Mean length | 9.5 |
| Min length | 6 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Mexico |
|---|---|
| 2nd row | United States |
| Value | Count | Frequency (%) |
| mexico | 1 | |
| united | 1 | |
| states | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 3 | |
| t | 3 | |
| i | 2 | |
| M | 1 | 5.3% |
| x | 1 | 5.3% |
| c | 1 | 5.3% |
| o | 1 | 5.3% |
| U | 1 | 5.3% |
| n | 1 | 5.3% |
| d | 1 | 5.3% |
| Other values (4) | 4 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 15 | |
| Uppercase Letter | 3 | 15.8% |
| Space Separator | 1 | 5.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 3 | |
| t | 3 | |
| i | 2 | |
| x | 1 | 6.7% |
| c | 1 | 6.7% |
| o | 1 | 6.7% |
| n | 1 | 6.7% |
| d | 1 | 6.7% |
| a | 1 | 6.7% |
| s | 1 | 6.7% |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 1 | |
| U | 1 | |
| S | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 18 | |
| Common | 1 | 5.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 3 | |
| t | 3 | |
| i | 2 | |
| M | 1 | 5.6% |
| x | 1 | 5.6% |
| c | 1 | 5.6% |
| o | 1 | 5.6% |
| U | 1 | 5.6% |
| n | 1 | 5.6% |
| d | 1 | 5.6% |
| Other values (3) | 3 |
Common
| Value | Count | Frequency (%) |
| 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 19 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 3 | |
| t | 3 | |
| i | 2 | |
| M | 1 | 5.3% |
| x | 1 | 5.3% |
| c | 1 | 5.3% |
| o | 1 | 5.3% |
| U | 1 | 5.3% |
| n | 1 | 5.3% |
| d | 1 | 5.3% |
| Other values (4) | 4 |
fieldNumber
Text
Missing 
| Distinct | 62645 |
|---|---|
| Distinct (%) | 10.7% |
| Missing | 1339537 |
| Missing (%) | 69.5% |
| Memory size | 14.7 MiB |
Length
| Max length | 111 |
|---|---|
| Median length | 63 |
| Mean length | 13.61587079 |
| Min length | 1 |
Unique
| Unique | 27485 ? |
|---|---|
| Unique (%) | 4.7% |
Sample
| 1st row | MMS-CABP/02B-E4 |
|---|---|
| 2nd row | 4/III-23-TDS |
| 3rd row | USARP/EL/12/1002/USC |
| 4th row | USFC/A2059 |
| 5th row | USFC/A5374 |
| Value | Count | Frequency (%) |
| mms-mafla/jar | 17287 | 2.6% |
| bolland/rfb | 7604 | 1.1% |
| humes | 5242 | 0.8% |
| jpem | 5029 | 0.8% |
| 4975 | 0.8% | |
| rh | 2306 | 0.3% |
| k-rh | 1556 | 0.2% |
| spm | 1163 | 0.2% |
| mnhn-norfolk | 1131 | 0.2% |
| haul | 1039 | 0.2% |
| Other values (59081) | 614323 |
Most occurring characters
| Value | Count | Frequency (%) |
| / | 742625 | 9.3% |
| S | 650584 | 8.1% |
| M | 501285 | 6.3% |
| - | 479984 | 6.0% |
| A | 421779 | 5.3% |
| 1 | 403168 | 5.0% |
| 0 | 377764 | 4.7% |
| C | 368103 | 4.6% |
| 2 | 360900 | 4.5% |
| U | 266483 | 3.3% |
| Other values (72) | 3413360 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 3901644 | |
| Decimal Number | 2536481 | |
| Other Punctuation | 835531 | 10.5% |
| Dash Punctuation | 479984 | 6.0% |
| Lowercase Letter | 145874 | 1.8% |
| Space Separator | 75131 | 0.9% |
| Connector Punctuation | 7570 | 0.1% |
| Open Punctuation | 1756 | < 0.1% |
| Close Punctuation | 1756 | < 0.1% |
| Math Symbol | 302 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 650584 | |
| M | 501285 | |
| A | 421779 | |
| C | 368103 | |
| U | 266483 | 6.8% |
| F | 236148 | 6.1% |
| I | 186834 | 4.8% |
| R | 170590 | 4.4% |
| L | 169956 | 4.4% |
| P | 165581 | 4.2% |
| Other values (16) | 764301 |
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 25302 | |
| r | 24949 | |
| a | 23100 | |
| l | 9448 | 6.5% |
| s | 8103 | 5.6% |
| i | 7885 | 5.4% |
| o | 7864 | 5.4% |
| u | 7557 | 5.2% |
| m | 5785 | 4.0% |
| t | 4696 | 3.2% |
| Other values (16) | 21185 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 742625 | |
| : | 80839 | 9.7% |
| . | 4233 | 0.5% |
| ; | 3671 | 0.4% |
| , | 2634 | 0.3% |
| # | 938 | 0.1% |
| \ | 340 | < 0.1% |
| ? | 150 | < 0.1% |
| & | 61 | < 0.1% |
| " | 16 | < 0.1% |
| Other values (2) | 24 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 403168 | |
| 0 | 377764 | |
| 2 | 360900 | |
| 5 | 260694 | |
| 3 | 252335 | |
| 4 | 217269 | |
| 7 | 192296 | |
| 6 | 178137 | |
| 8 | 164674 | |
| 9 | 129244 | 5.1% |
Math Symbol
| Value | Count | Frequency (%) |
| + | 290 | |
| = | 12 | 4.0% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 479984 |
Space Separator
| Value | Count | Frequency (%) |
| 75131 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 7570 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1756 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1756 |
Control
| Value | Count | Frequency (%) |
| | 6 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4047518 | |
| Common | 3938517 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| S | 650584 | |
| M | 501285 | |
| A | 421779 | |
| C | 368103 | |
| U | 266483 | 6.6% |
| F | 236148 | 5.8% |
| I | 186834 | 4.6% |
| R | 170590 | 4.2% |
| L | 169956 | 4.2% |
| P | 165581 | 4.1% |
| Other values (42) | 910175 |
Common
| Value | Count | Frequency (%) |
| / | 742625 | |
| - | 479984 | |
| 1 | 403168 | |
| 0 | 377764 | |
| 2 | 360900 | |
| 5 | 260694 | 6.6% |
| 3 | 252335 | 6.4% |
| 4 | 217269 | 5.5% |
| 7 | 192296 | 4.9% |
| 6 | 178137 | 4.5% |
| Other values (20) | 473345 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 7986035 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| / | 742625 | 9.3% |
| S | 650584 | 8.1% |
| M | 501285 | 6.3% |
| - | 479984 | 6.0% |
| A | 421779 | 5.3% |
| 1 | 403168 | 5.0% |
| 0 | 377764 | 4.7% |
| C | 368103 | 4.6% |
| 2 | 360900 | 4.5% |
| U | 266483 | 3.3% |
| Other values (72) | 3413360 |
eventDate
Text
Missing 
| Distinct | 46451 |
|---|---|
| Distinct (%) | 3.7% |
| Missing | 684431 |
| Missing (%) | 35.5% |
| Memory size | 14.7 MiB |
Length
| Max length | 24 |
|---|---|
| Median length | 10 |
| Mean length | 9.847445696 |
| Min length | 4 |
Unique
| Unique | 7284 ? |
|---|---|
| Unique (%) | 0.6% |
Sample
| 1st row | 1976-03-03 |
|---|---|
| 2nd row | 1984-05-15 |
| 3rd row | 1964-03-15 |
| 4th row | 1883-08-31 |
| 5th row | 1909-03-02 |
| Value | Count | Frequency (%) |
| 1915 | 6240 | 0.5% |
| 1982-07-21 | 5683 | 0.5% |
| 1981-07-06 | 5412 | 0.4% |
| 1983-05-13 | 5155 | 0.4% |
| 1982-11-19 | 5037 | 0.4% |
| 1982-02-10 | 4461 | 0.4% |
| 1981-11-09 | 4296 | 0.3% |
| 1913 | 4289 | 0.3% |
| 1982-05-10 | 4268 | 0.3% |
| 1977-01-28/1977-02-13 | 3795 | 0.3% |
| Other values (46407) | 1193140 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 2356589 | |
| - | 2334518 | |
| 0 | 1811279 | |
| 9 | 1510678 | |
| 2 | 833024 | 6.8% |
| 8 | 784802 | 6.4% |
| 7 | 719423 | 5.9% |
| 6 | 566996 | 4.6% |
| 5 | 438603 | 3.6% |
| 3 | 432985 | 3.5% |
| Other values (11) | 437987 | 3.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 9840824 | |
| Dash Punctuation | 2334518 | 19.1% |
| Other Punctuation | 51245 | 0.4% |
| Lowercase Letter | 150 | < 0.1% |
| Space Separator | 146 | < 0.1% |
| Uppercase Letter | 1 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 2356589 | |
| 0 | 1811279 | |
| 9 | 1510678 | |
| 2 | 833024 | 8.5% |
| 8 | 784802 | 8.0% |
| 7 | 719423 | 7.3% |
| 6 | 566996 | 5.8% |
| 5 | 438603 | 4.5% |
| 3 | 432985 | 4.4% |
| 4 | 386445 | 3.9% |
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 73 | |
| r | 73 | |
| e | 1 | 0.7% |
| x | 1 | 0.7% |
| a | 1 | 0.7% |
| s | 1 | 0.7% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 50988 | |
| , | 257 | 0.5% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2334518 |
Space Separator
| Value | Count | Frequency (%) |
| 146 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 12226733 | |
| Latin | 151 | < 0.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 2356589 | |
| - | 2334518 | |
| 0 | 1811279 | |
| 9 | 1510678 | |
| 2 | 833024 | 6.8% |
| 8 | 784802 | 6.4% |
| 7 | 719423 | 5.9% |
| 6 | 566996 | 4.6% |
| 5 | 438603 | 3.6% |
| 3 | 432985 | 3.5% |
| Other values (4) | 437836 | 3.6% |
Latin
| Value | Count | Frequency (%) |
| o | 73 | |
| r | 73 | |
| T | 1 | 0.7% |
| e | 1 | 0.7% |
| x | 1 | 0.7% |
| a | 1 | 0.7% |
| s | 1 | 0.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 12226884 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 2356589 | |
| - | 2334518 | |
| 0 | 1811279 | |
| 9 | 1510678 | |
| 2 | 833024 | 6.8% |
| 8 | 784802 | 6.4% |
| 7 | 719423 | 5.9% |
| 6 | 566996 | 4.6% |
| 5 | 438603 | 3.6% |
| 3 | 432985 | 3.5% |
| Other values (11) | 437987 | 3.6% |
startDayOfYear
Text
Missing 
| Distinct | 366 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 772926 |
| Missing (%) | 40.1% |
| Memory size | 14.7 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 2.745095761 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 63 |
|---|---|
| 2nd row | 136 |
| 3rd row | 75 |
| 4th row | 243 |
| 5th row | 61 |
| Value | Count | Frequency (%) |
| 243 | 12547 | 1.1% |
| 334 | 10327 | 0.9% |
| 151 | 9378 | 0.8% |
| 202 | 9211 | 0.8% |
| 133 | 9049 | 0.8% |
| 212 | 8665 | 0.8% |
| 187 | 8345 | 0.7% |
| 130 | 7951 | 0.7% |
| 323 | 7924 | 0.7% |
| 41 | 7863 | 0.7% |
| Other values (356) | 1061875 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 622454 | |
| 2 | 590934 | |
| 3 | 457326 | |
| 4 | 256897 | |
| 5 | 233467 | 7.4% |
| 0 | 218600 | 6.9% |
| 6 | 207670 | 6.6% |
| 9 | 203223 | 6.4% |
| 7 | 193100 | 6.1% |
| 8 | 181795 | 5.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3165466 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 622454 | |
| 2 | 590934 | |
| 3 | 457326 | |
| 4 | 256897 | |
| 5 | 233467 | 7.4% |
| 0 | 218600 | 6.9% |
| 6 | 207670 | 6.6% |
| 9 | 203223 | 6.4% |
| 7 | 193100 | 6.1% |
| 8 | 181795 | 5.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3165466 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 622454 | |
| 2 | 590934 | |
| 3 | 457326 | |
| 4 | 256897 | |
| 5 | 233467 | 7.4% |
| 0 | 218600 | 6.9% |
| 6 | 207670 | 6.6% |
| 9 | 203223 | 6.4% |
| 7 | 193100 | 6.1% |
| 8 | 181795 | 5.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3165466 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 622454 | |
| 2 | 590934 | |
| 3 | 457326 | |
| 4 | 256897 | |
| 5 | 233467 | 7.4% |
| 0 | 218600 | 6.9% |
| 6 | 207670 | 6.6% |
| 9 | 203223 | 6.4% |
| 7 | 193100 | 6.1% |
| 8 | 181795 | 5.7% |
endDayOfYear
Text
Missing 
| Distinct | 368 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 773095 |
| Missing (%) | 40.1% |
| Memory size | 14.7 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 3 |
| Mean length | 2.745963021 |
| Min length | 1 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 63 |
|---|---|
| 2nd row | 136 |
| 3rd row | 75 |
| 4th row | 243 |
| 5th row | 61 |
| Value | Count | Frequency (%) |
| 243 | 12439 | 1.1% |
| 334 | 10162 | 0.9% |
| 151 | 9376 | 0.8% |
| 202 | 9186 | 0.8% |
| 133 | 9037 | 0.8% |
| 212 | 8808 | 0.8% |
| 187 | 8348 | 0.7% |
| 41 | 7969 | 0.7% |
| 323 | 7922 | 0.7% |
| 130 | 7868 | 0.7% |
| Other values (360) | 1061853 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 626592 | |
| 2 | 588319 | |
| 3 | 458636 | |
| 4 | 260045 | |
| 5 | 235565 | 7.4% |
| 0 | 220401 | 7.0% |
| 6 | 202442 | 6.4% |
| 9 | 198660 | 6.3% |
| 7 | 192143 | 6.1% |
| 8 | 183183 | 5.8% |
| Other values (10) | 16 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3165986 | |
| Lowercase Letter | 10 | < 0.1% |
| Uppercase Letter | 4 | < 0.1% |
| Space Separator | 2 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 626592 | |
| 2 | 588319 | |
| 3 | 458636 | |
| 4 | 260045 | |
| 5 | 235565 | 7.4% |
| 0 | 220401 | 7.0% |
| 6 | 202442 | 6.4% |
| 9 | 198660 | 6.3% |
| 7 | 192143 | 6.1% |
| 8 | 183183 | 5.8% |
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 4 | |
| e | 2 | |
| z | 1 | 10.0% |
| g | 1 | 10.0% |
| l | 1 | 10.0% |
| k | 1 | 10.0% |
Uppercase Letter
| Value | Count | Frequency (%) |
| L | 2 | |
| P | 1 | |
| E | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3165988 | |
| Latin | 14 | < 0.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 626592 | |
| 2 | 588319 | |
| 3 | 458636 | |
| 4 | 260045 | |
| 5 | 235565 | 7.4% |
| 0 | 220401 | 7.0% |
| 6 | 202442 | 6.4% |
| 9 | 198660 | 6.3% |
| 7 | 192143 | 6.1% |
| 8 | 183183 | 5.8% |
Latin
| Value | Count | Frequency (%) |
| a | 4 | |
| e | 2 | |
| L | 2 | |
| P | 1 | 7.1% |
| z | 1 | 7.1% |
| E | 1 | 7.1% |
| g | 1 | 7.1% |
| l | 1 | 7.1% |
| k | 1 | 7.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3166002 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 626592 | |
| 2 | 588319 | |
| 3 | 458636 | |
| 4 | 260045 | |
| 5 | 235565 | 7.4% |
| 0 | 220401 | 7.0% |
| 6 | 202442 | 6.4% |
| 9 | 198660 | 6.3% |
| 7 | 192143 | 6.1% |
| 8 | 183183 | 5.8% |
| Other values (10) | 16 | < 0.1% |
year
Text
Missing 
| Distinct | 208 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 684432 |
| Missing (%) | 35.5% |
| Memory size | 14.7 MiB |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Unique
| Unique | 7 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 1976 |
|---|---|
| 2nd row | 1984 |
| 3rd row | 1964 |
| 4th row | 1883 |
| 5th row | 1909 |
| Value | Count | Frequency (%) |
| 1977 | 73833 | 5.9% |
| 1981 | 43888 | 3.5% |
| 1976 | 42215 | 3.4% |
| 1982 | 38215 | 3.1% |
| 1984 | 38199 | 3.1% |
| 1908 | 35404 | 2.9% |
| 1983 | 34028 | 2.7% |
| 1985 | 30489 | 2.5% |
| 1964 | 28252 | 2.3% |
| 1975 | 25217 | 2.0% |
| Other values (198) | 851889 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 1365971 | |
| 9 | 1248710 | |
| 8 | 526126 | 10.6% |
| 7 | 429937 | 8.7% |
| 6 | 323883 | 6.5% |
| 0 | 306308 | 6.2% |
| 2 | 220325 | 4.4% |
| 5 | 194998 | 3.9% |
| 4 | 178212 | 3.6% |
| 3 | 172046 | 3.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 4966516 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1365971 | |
| 9 | 1248710 | |
| 8 | 526126 | 10.6% |
| 7 | 429937 | 8.7% |
| 6 | 323883 | 6.5% |
| 0 | 306308 | 6.2% |
| 2 | 220325 | 4.4% |
| 5 | 194998 | 3.9% |
| 4 | 178212 | 3.6% |
| 3 | 172046 | 3.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 4966516 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 1365971 | |
| 9 | 1248710 | |
| 8 | 526126 | 10.6% |
| 7 | 429937 | 8.7% |
| 6 | 323883 | 6.5% |
| 0 | 306308 | 6.2% |
| 2 | 220325 | 4.4% |
| 5 | 194998 | 3.9% |
| 4 | 178212 | 3.6% |
| 3 | 172046 | 3.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4966516 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 1365971 | |
| 9 | 1248710 | |
| 8 | 526126 | 10.6% |
| 7 | 429937 | 8.7% |
| 6 | 323883 | 6.5% |
| 0 | 306308 | 6.2% |
| 2 | 220325 | 4.4% |
| 5 | 194998 | 3.9% |
| 4 | 178212 | 3.6% |
| 3 | 172046 | 3.5% |
month
Text
Missing 
| Distinct | 12 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 768070 |
| Missing (%) | 39.9% |
| Memory size | 14.7 MiB |
Length
| Max length | 2 |
|---|---|
| Median length | 1 |
| Mean length | 1.188973835 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 3 |
|---|---|
| 2nd row | 5 |
| 3rd row | 3 |
| 4th row | 8 |
| 5th row | 3 |
| Value | Count | Frequency (%) |
| 8 | 133419 | |
| 5 | 128908 | |
| 7 | 123523 | |
| 6 | 108501 | |
| 4 | 100253 | |
| 11 | 97478 | |
| 2 | 97354 | |
| 3 | 89640 | |
| 9 | 87608 | |
| 1 | 69955 | |
| Other values (2) | 121352 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 386263 | |
| 2 | 150267 | 10.9% |
| 8 | 133419 | 9.7% |
| 5 | 128908 | 9.4% |
| 7 | 123523 | 9.0% |
| 6 | 108501 | 7.9% |
| 4 | 100253 | 7.3% |
| 3 | 89640 | 6.5% |
| 9 | 87608 | 6.4% |
| 0 | 68439 | 5.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1376821 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 386263 | |
| 2 | 150267 | 10.9% |
| 8 | 133419 | 9.7% |
| 5 | 128908 | 9.4% |
| 7 | 123523 | 9.0% |
| 6 | 108501 | 7.9% |
| 4 | 100253 | 7.3% |
| 3 | 89640 | 6.5% |
| 9 | 87608 | 6.4% |
| 0 | 68439 | 5.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1376821 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 386263 | |
| 2 | 150267 | 10.9% |
| 8 | 133419 | 9.7% |
| 5 | 128908 | 9.4% |
| 7 | 123523 | 9.0% |
| 6 | 108501 | 7.9% |
| 4 | 100253 | 7.3% |
| 3 | 89640 | 6.5% |
| 9 | 87608 | 6.4% |
| 0 | 68439 | 5.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1376821 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 386263 | |
| 2 | 150267 | 10.9% |
| 8 | 133419 | 9.7% |
| 5 | 128908 | 9.4% |
| 7 | 123523 | 9.0% |
| 6 | 108501 | 7.9% |
| 4 | 100253 | 7.3% |
| 3 | 89640 | 6.5% |
| 9 | 87608 | 6.4% |
| 0 | 68439 | 5.0% |
day
Text
Missing 
| Distinct | 31 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 841840 |
| Missing (%) | 43.7% |
| Memory size | 14.7 MiB |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 1.705713134 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 3 |
|---|---|
| 2nd row | 15 |
| 3rd row | 15 |
| 4th row | 31 |
| 5th row | 2 |
| Value | Count | Frequency (%) |
| 10 | 46154 | 4.3% |
| 13 | 45658 | 4.2% |
| 19 | 44172 | 4.1% |
| 6 | 40659 | 3.8% |
| 21 | 40525 | 3.7% |
| 15 | 38548 | 3.6% |
| 8 | 38174 | 3.5% |
| 9 | 38061 | 3.5% |
| 18 | 36739 | 3.4% |
| 14 | 36106 | 3.3% |
| Other values (21) | 679425 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 508569 | |
| 2 | 433588 | |
| 3 | 161568 | 8.7% |
| 5 | 109545 | 5.9% |
| 9 | 109355 | 5.9% |
| 0 | 109327 | 5.9% |
| 8 | 109113 | 5.9% |
| 6 | 105547 | 5.7% |
| 4 | 102072 | 5.5% |
| 7 | 100686 | 5.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1849370 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 508569 | |
| 2 | 433588 | |
| 3 | 161568 | 8.7% |
| 5 | 109545 | 5.9% |
| 9 | 109355 | 5.9% |
| 0 | 109327 | 5.9% |
| 8 | 109113 | 5.9% |
| 6 | 105547 | 5.7% |
| 4 | 102072 | 5.5% |
| 7 | 100686 | 5.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1849370 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 508569 | |
| 2 | 433588 | |
| 3 | 161568 | 8.7% |
| 5 | 109545 | 5.9% |
| 9 | 109355 | 5.9% |
| 0 | 109327 | 5.9% |
| 8 | 109113 | 5.9% |
| 6 | 105547 | 5.7% |
| 4 | 102072 | 5.5% |
| 7 | 100686 | 5.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1849370 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 508569 | |
| 2 | 433588 | |
| 3 | 161568 | 8.7% |
| 5 | 109545 | 5.9% |
| 9 | 109355 | 5.9% |
| 0 | 109327 | 5.9% |
| 8 | 109113 | 5.9% |
| 6 | 105547 | 5.7% |
| 4 | 102072 | 5.5% |
| 7 | 100686 | 5.4% |
Missing 
| Distinct | 47773 |
|---|---|
| Distinct (%) | 6.3% |
| Missing | 1172997 |
| Missing (%) | 60.9% |
| Memory size | 14.7 MiB |
Length
| Max length | 181 |
|---|---|
| Median length | 11 |
| Mean length | 11.01792942 |
| Min length | 1 |
Unique
| Unique | 15836 ? |
|---|---|
| Unique (%) | 2.1% |
Sample
| 1st row | -- --- ---- |
|---|---|
| 2nd row | 15 MAY 1984 |
| 3rd row | 15 MAR 1964 |
| 4th row | 03 MAR 1967 |
| 5th row | 31 AUG 1958 |
| Value | Count | Frequency (%) |
| 275863 | 12.6% | |
| may | 68613 | 3.1% |
| aug | 65838 | 3.0% |
| jul | 61523 | 2.8% |
| apr | 57927 | 2.6% |
| feb | 53279 | 2.4% |
| jun | 52775 | 2.4% |
| nov | 52199 | 2.4% |
| mar | 46116 | 2.1% |
| 1977 | 42123 | 1.9% |
| Other values (8402) | 1418761 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1441953 | ||
| 1 | 1077360 | |
| 9 | 807768 | 9.7% |
| - | 749476 | 9.0% |
| 2 | 340220 | 4.1% |
| 7 | 334219 | 4.0% |
| 0 | 322795 | 3.9% |
| 8 | 301908 | 3.6% |
| 6 | 296029 | 3.6% |
| A | 274073 | 3.3% |
| Other values (71) | 2351405 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 4020395 | |
| Uppercase Letter | 1821264 | |
| Space Separator | 1441953 | 17.4% |
| Dash Punctuation | 749476 | 9.0% |
| Lowercase Letter | 202093 | 2.4% |
| Other Punctuation | 58036 | 0.7% |
| Close Punctuation | 1858 | < 0.1% |
| Open Punctuation | 1855 | < 0.1% |
| Connector Punctuation | 187 | < 0.1% |
| Math Symbol | 89 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 23080 | |
| r | 22925 | |
| l | 19197 | |
| n | 18908 | |
| i | 17650 | |
| a | 15940 | |
| t | 14268 | |
| p | 13296 | 6.6% |
| g | 11951 | 5.9% |
| u | 11197 | 5.5% |
| Other values (15) | 33681 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 274073 | |
| U | 175994 | 9.7% |
| J | 155768 | 8.6% |
| N | 143274 | 7.9% |
| M | 120553 | 6.6% |
| E | 116114 | 6.4% |
| R | 101232 | 5.6% |
| P | 93207 | 5.1% |
| O | 88359 | 4.9% |
| Y | 68066 | 3.7% |
| Other values (14) | 484624 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 19785 | |
| / | 15745 | |
| , | 11251 | |
| : | 9405 | |
| ; | 983 | 1.7% |
| ? | 319 | 0.5% |
| & | 294 | 0.5% |
| ' | 244 | 0.4% |
| " | 5 | < 0.1% |
| \ | 2 | < 0.1% |
| Other values (2) | 3 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1077360 | |
| 9 | 807768 | |
| 2 | 340220 | 8.5% |
| 7 | 334219 | 8.3% |
| 0 | 322795 | 8.0% |
| 8 | 301908 | 7.5% |
| 6 | 296029 | 7.4% |
| 3 | 203527 | 5.1% |
| 5 | 177538 | 4.4% |
| 4 | 159031 | 4.0% |
Math Symbol
| Value | Count | Frequency (%) |
| + | 80 | |
| ~ | 8 | 9.0% |
| < | 1 | 1.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1836 | |
| ] | 22 | 1.2% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1835 | |
| [ | 20 | 1.1% |
Space Separator
| Value | Count | Frequency (%) |
| 1441953 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 749476 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 187 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 6273849 | |
| Latin | 2023357 | 24.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 274073 | 13.5% |
| U | 175994 | 8.7% |
| J | 155768 | 7.7% |
| N | 143274 | 7.1% |
| M | 120553 | 6.0% |
| E | 116114 | 5.7% |
| R | 101232 | 5.0% |
| P | 93207 | 4.6% |
| O | 88359 | 4.4% |
| Y | 68066 | 3.4% |
| Other values (39) | 686717 |
Common
| Value | Count | Frequency (%) |
| 1441953 | ||
| 1 | 1077360 | |
| 9 | 807768 | |
| - | 749476 | |
| 2 | 340220 | 5.4% |
| 7 | 334219 | 5.3% |
| 0 | 322795 | 5.1% |
| 8 | 301908 | 4.8% |
| 6 | 296029 | 4.7% |
| 3 | 203527 | 3.2% |
| Other values (22) | 398594 | 6.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8297206 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1441953 | ||
| 1 | 1077360 | |
| 9 | 807768 | 9.7% |
| - | 749476 | 9.0% |
| 2 | 340220 | 4.1% |
| 7 | 334219 | 4.0% |
| 0 | 322795 | 3.9% |
| 8 | 301908 | 3.6% |
| 6 | 296029 | 3.6% |
| A | 274073 | 3.3% |
| Other values (71) | 2351405 |
habitat
Text
Missing 
| Distinct | 18959 |
|---|---|
| Distinct (%) | 27.4% |
| Missing | 1856817 |
| Missing (%) | 96.4% |
| Memory size | 14.7 MiB |
Length
| Max length | 235 |
|---|---|
| Median length | 159 |
| Mean length | 19.79862515 |
| Min length | 1 |
Unique
| Unique | 13598 ? |
|---|---|
| Unique (%) | 19.6% |
Sample
| 1st row | Beach with fresh water creek running into it |
|---|---|
| 2nd row | Freshwater |
| 3rd row | In sand |
| 4th row | Mangrove |
| 5th row | Under rocks |
| Value | Count | Frequency (%) |
| freshwater | 9206 | 4.1% |
| in | 6886 | 3.1% |
| on | 6372 | 2.8% |
| reef | 6192 | 2.8% |
| sand | 6091 | 2.7% |
| coral | 5812 | 2.6% |
| of | 4886 | 2.2% |
| rocks | 4638 | 2.1% |
| sp | 4290 | 1.9% |
| intertidal | 4237 | 1.9% |
| Other values (6964) | 165771 |
Most occurring characters
| Value | Count | Frequency (%) |
| 155137 | 11.3% | |
| e | 134073 | 9.8% |
| a | 117948 | 8.6% |
| r | 101183 | 7.4% |
| n | 83040 | 6.1% |
| s | 82869 | 6.0% |
| o | 79792 | 5.8% |
| t | 71836 | 5.2% |
| i | 60744 | 4.4% |
| l | 60219 | 4.4% |
| Other values (79) | 424095 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1121590 | |
| Space Separator | 155137 | 11.3% |
| Uppercase Letter | 60784 | 4.4% |
| Other Punctuation | 20717 | 1.5% |
| Decimal Number | 6940 | 0.5% |
| Math Symbol | 2493 | 0.2% |
| Dash Punctuation | 1845 | 0.1% |
| Open Punctuation | 717 | 0.1% |
| Close Punctuation | 712 | 0.1% |
| Other Symbol | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 134073 | |
| a | 117948 | |
| r | 101183 | 9.0% |
| n | 83040 | 7.4% |
| s | 82869 | 7.4% |
| o | 79792 | 7.1% |
| t | 71836 | 6.4% |
| i | 60744 | 5.4% |
| l | 60219 | 5.4% |
| d | 54652 | 4.9% |
| Other values (18) | 275234 |
Uppercase Letter
| Value | Count | Frequency (%) |
| F | 12117 | |
| L | 6196 | |
| S | 6180 | |
| I | 5574 | |
| R | 4363 | 7.2% |
| O | 3936 | 6.5% |
| M | 3425 | 5.6% |
| C | 3203 | 5.3% |
| U | 2435 | 4.0% |
| B | 2327 | 3.8% |
| Other values (16) | 11028 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 10234 | |
| . | 7692 | |
| ; | 838 | 4.0% |
| / | 686 | 3.3% |
| ' | 442 | 2.1% |
| # | 299 | 1.4% |
| & | 196 | 0.9% |
| : | 111 | 0.5% |
| % | 90 | 0.4% |
| " | 69 | 0.3% |
| Other values (3) | 60 | 0.3% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1216 | |
| 0 | 1156 | |
| 2 | 887 | |
| 5 | 749 | |
| 3 | 666 | |
| 4 | 598 | |
| 6 | 522 | |
| 8 | 389 | 5.6% |
| 7 | 387 | 5.6% |
| 9 | 370 | 5.3% |
Math Symbol
| Value | Count | Frequency (%) |
| + | 2456 | |
| = | 24 | 1.0% |
| < | 7 | 0.3% |
| ~ | 4 | 0.2% |
| > | 2 | 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 714 | |
| [ | 3 | 0.4% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 710 | |
| ] | 2 | 0.3% |
Space Separator
| Value | Count | Frequency (%) |
| 155137 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1845 |
Other Symbol
| Value | Count | Frequency (%) |
| ° | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1182374 | |
| Common | 188562 | 13.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 134073 | 11.3% |
| a | 117948 | 10.0% |
| r | 101183 | 8.6% |
| n | 83040 | 7.0% |
| s | 82869 | 7.0% |
| o | 79792 | 6.7% |
| t | 71836 | 6.1% |
| i | 60744 | 5.1% |
| l | 60219 | 5.1% |
| d | 54652 | 4.6% |
| Other values (44) | 336018 |
Common
| Value | Count | Frequency (%) |
| 155137 | ||
| , | 10234 | 5.4% |
| . | 7692 | 4.1% |
| + | 2456 | 1.3% |
| - | 1845 | 1.0% |
| 1 | 1216 | 0.6% |
| 0 | 1156 | 0.6% |
| 2 | 887 | 0.5% |
| ; | 838 | 0.4% |
| 5 | 749 | 0.4% |
| Other values (25) | 6352 | 3.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1370933 | |
| None | 3 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 155137 | 11.3% | |
| e | 134073 | 9.8% |
| a | 117948 | 8.6% |
| r | 101183 | 7.4% |
| n | 83040 | 6.1% |
| s | 82869 | 6.0% |
| o | 79792 | 5.8% |
| t | 71836 | 5.2% |
| i | 60744 | 4.4% |
| l | 60219 | 4.4% |
| Other values (76) | 424092 |
None
| Value | Count | Frequency (%) |
| é | 1 | |
| ° | 1 | |
| ç | 1 |
locationID
Text
Missing 
| Distinct | 94697 |
|---|---|
| Distinct (%) | 10.1% |
| Missing | 983901 |
| Missing (%) | 51.1% |
| Memory size | 14.7 MiB |
Length
| Max length | 23319 |
|---|---|
| Median length | 146 |
| Mean length | 4.468806784 |
| Min length | 1 |
Unique
| Unique | 52902 ? |
|---|---|
| Unique (%) | 5.6% |
Sample
| 1st row | E4 |
|---|---|
| 2nd row | NR 12-4 ID 101 |
| 3rd row | 23 |
| 4th row | 1002 |
| 5th row | 2059 |
| Value | Count | Frequency (%) |
| not | 12390 | 1.2% |
| rec | 12068 | 1.2% |
| 4 | 8477 | 0.8% |
| rhb | 7694 | 0.7% |
| rfb | 7622 | 0.7% |
| 1 | 7590 | 0.7% |
| 2 | 6237 | 0.6% |
| 3 | 5500 | 0.5% |
| gs | 5167 | 0.5% |
| 6 | 5009 | 0.5% |
| Other values (80977) | 965185 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 474193 | 11.3% |
| 2 | 393866 | 9.4% |
| 0 | 331627 | 7.9% |
| 5 | 295816 | 7.0% |
| 3 | 287513 | 6.8% |
| 4 | 264033 | 6.3% |
| - | 262213 | 6.2% |
| 6 | 216530 | 5.1% |
| 7 | 190846 | 4.5% |
| 8 | 180812 | 4.3% |
| Other values (84) | 1312882 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2800702 | |
| Uppercase Letter | 880118 | 20.9% |
| Dash Punctuation | 262221 | 6.2% |
| Space Separator | 99236 | 2.4% |
| Other Punctuation | 76024 | 1.8% |
| Lowercase Letter | 68179 | 1.6% |
| Control | 8490 | 0.2% |
| Connector Punctuation | 7888 | 0.2% |
| Close Punctuation | 3371 | 0.1% |
| Open Punctuation | 3364 | 0.1% |
| Other values (2) | 738 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 92080 | 10.5% |
| S | 79022 | 9.0% |
| C | 71641 | 8.1% |
| B | 66978 | 7.6% |
| R | 60079 | 6.8% |
| M | 56872 | 6.5% |
| N | 52082 | 5.9% |
| E | 48007 | 5.5% |
| I | 44751 | 5.1% |
| T | 36825 | 4.2% |
| Other values (17) | 271781 |
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 9934 | |
| o | 8028 | |
| r | 7686 | |
| a | 7336 | |
| i | 4282 | 6.3% |
| t | 3986 | 5.8% |
| l | 3681 | 5.4% |
| n | 3059 | 4.5% |
| c | 2995 | 4.4% |
| s | 2614 | 3.8% |
| Other values (17) | 14578 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 37458 | |
| . | 24829 | |
| , | 7382 | 9.7% |
| / | 3928 | 5.2% |
| # | 1569 | 2.1% |
| & | 290 | 0.4% |
| ? | 151 | 0.2% |
| ; | 133 | 0.2% |
| * | 124 | 0.2% |
| ' | 119 | 0.2% |
| Other values (4) | 41 | 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 474193 | |
| 2 | 393866 | |
| 0 | 331627 | |
| 5 | 295816 | |
| 3 | 287513 | |
| 4 | 264033 | |
| 6 | 216530 | |
| 7 | 190846 | |
| 8 | 180812 | 6.5% |
| 9 | 165466 | 5.9% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 3081 | |
| ] | 288 | 8.5% |
| } | 2 | 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 3074 | |
| [ | 288 | 8.6% |
| { | 2 | 0.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 262213 | |
| – | 8 | < 0.1% |
Control
| Value | Count | Frequency (%) |
| 8445 | ||
| 45 | 0.5% |
Math Symbol
| Value | Count | Frequency (%) |
| + | 724 | |
| = | 12 | 1.6% |
Other Number
| Value | Count | Frequency (%) |
| ₁ | 1 | |
| ₂ | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 99236 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 7888 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3262034 | |
| Latin | 948297 | 22.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 92080 | 9.7% |
| S | 79022 | 8.3% |
| C | 71641 | 7.6% |
| B | 66978 | 7.1% |
| R | 60079 | 6.3% |
| M | 56872 | 6.0% |
| N | 52082 | 5.5% |
| E | 48007 | 5.1% |
| I | 44751 | 4.7% |
| T | 36825 | 3.9% |
| Other values (44) | 339960 |
Common
| Value | Count | Frequency (%) |
| 1 | 474193 | |
| 2 | 393866 | |
| 0 | 331627 | |
| 5 | 295816 | |
| 3 | 287513 | |
| 4 | 264033 | |
| - | 262213 | |
| 6 | 216530 | |
| 7 | 190846 | |
| 8 | 180812 | 5.5% |
| Other values (30) | 364585 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4210319 | |
| Punctuation | 8 | < 0.1% |
| None | 4 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 474193 | 11.3% |
| 2 | 393866 | 9.4% |
| 0 | 331627 | 7.9% |
| 5 | 295816 | 7.0% |
| 3 | 287513 | 6.8% |
| 4 | 264033 | 6.3% |
| - | 262213 | 6.2% |
| 6 | 216530 | 5.1% |
| 7 | 190846 | 4.5% |
| 8 | 180812 | 4.3% |
| Other values (79) | 1312870 |
Punctuation
| Value | Count | Frequency (%) |
| – | 8 |
None
| Value | Count | Frequency (%) |
| ü | 1 | |
| ₁ | 1 | |
| ₂ | 1 | |
| É | 1 |
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 1926060 |
| Missing (%) | > 99.9% |
| Memory size | 14.7 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 24.1667 |
|---|
| Value | Count | Frequency (%) |
| 24.1667 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 6 | 2 | |
| 2 | 1 | |
| 4 | 1 | |
| . | 1 | |
| 1 | 1 | |
| 7 | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 6 | |
| Other Punctuation | 1 | 14.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 6 | 2 | |
| 2 | 1 | |
| 4 | 1 | |
| 1 | 1 | |
| 7 | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 7 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 6 | 2 | |
| 2 | 1 | |
| 4 | 1 | |
| . | 1 | |
| 1 | 1 | |
| 7 | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 7 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 6 | 2 | |
| 2 | 1 | |
| 4 | 1 | |
| . | 1 | |
| 1 | 1 | |
| 7 | 1 |
higherGeography
Text
Missing 
| Distinct | 12371 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 67820 |
| Missing (%) | 3.5% |
| Memory size | 14.7 MiB |
Length
| Max length | 126 |
|---|---|
| Median length | 104 |
| Mean length | 36.17331336 |
| Min length | 4 |
Unique
| Unique | 3191 ? |
|---|---|
| Unique (%) | 0.2% |
Sample
| 1st row | North Atlantic Ocean, United States |
|---|---|
| 2nd row | North Atlantic Ocean, Gulf of Mexico, United States, Florida |
| 3rd row | North Atlantic Ocean, Caribbean Sea, Barbados |
| 4th row | North Atlantic Ocean, Gulf of Mexico, United States, Florida |
| 5th row | Philippines |
| Value | Count | Frequency (%) |
| ocean | 1259680 | 13.4% |
| north | 1097942 | 11.7% |
| united | 886041 | 9.4% |
| states | 871462 | 9.3% |
| atlantic | 718171 | 7.7% |
| pacific | 436930 | 4.7% |
| mexico | 248318 | 2.6% |
| of | 243317 | 2.6% |
| gulf | 228723 | 2.4% |
| south | 203297 | 2.2% |
| Other values (4653) | 3190921 |
Most occurring characters
| Value | Count | Frequency (%) |
| 7526561 | 11.2% | |
| a | 6864167 | 10.2% |
| t | 6255693 | 9.3% |
| i | 4779365 | 7.1% |
| e | 4733121 | 7.0% |
| n | 4583640 | 6.8% |
| c | 3759705 | 5.6% |
| o | 2896620 | 4.3% |
| , | 2856796 | 4.2% |
| r | 2271670 | 3.4% |
| Other values (73) | 20691396 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 47714802 | |
| Uppercase Letter | 9109346 | 13.6% |
| Space Separator | 7526561 | 11.2% |
| Other Punctuation | 2866960 | 4.3% |
| Dash Punctuation | 1038 | < 0.1% |
| Close Punctuation | 10 | < 0.1% |
| Open Punctuation | 10 | < 0.1% |
| Decimal Number | 6 | < 0.1% |
| Math Symbol | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 6864167 | |
| t | 6255693 | |
| i | 4779365 | |
| e | 4733121 | |
| n | 4583640 | |
| c | 3759705 | |
| o | 2896620 | 6.1% |
| r | 2271670 | 4.8% |
| s | 2140631 | 4.5% |
| l | 1954823 | 4.1% |
| Other values (28) | 7475367 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 1396876 | |
| O | 1301176 | |
| N | 1192784 | |
| A | 1063675 | |
| U | 893787 | |
| P | 682100 | |
| C | 555397 | 6.1% |
| M | 514428 | 5.6% |
| G | 305926 | 3.4% |
| F | 216131 | 2.4% |
| Other values (17) | 987066 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 2856796 | |
| . | 7748 | 0.3% |
| ' | 2246 | 0.1% |
| ? | 153 | < 0.1% |
| & | 11 | < 0.1% |
| / | 6 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 2 | |
| 8 | 1 | |
| 3 | 1 | |
| 2 | 1 | |
| 0 | 1 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 8 | |
| ] | 2 | 20.0% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 8 | |
| [ | 2 | 20.0% |
Space Separator
| Value | Count | Frequency (%) |
| 7526561 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1038 |
Math Symbol
| Value | Count | Frequency (%) |
| | | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 56824148 | |
| Common | 10394586 | 15.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 6864167 | |
| t | 6255693 | 11.0% |
| i | 4779365 | 8.4% |
| e | 4733121 | 8.3% |
| n | 4583640 | 8.1% |
| c | 3759705 | 6.6% |
| o | 2896620 | 5.1% |
| r | 2271670 | 4.0% |
| s | 2140631 | 3.8% |
| l | 1954823 | 3.4% |
| Other values (55) | 16584713 |
Common
| Value | Count | Frequency (%) |
| 7526561 | ||
| , | 2856796 | 27.5% |
| . | 7748 | 0.1% |
| ' | 2246 | < 0.1% |
| - | 1038 | < 0.1% |
| ? | 153 | < 0.1% |
| & | 11 | < 0.1% |
| ) | 8 | < 0.1% |
| ( | 8 | < 0.1% |
| / | 6 | < 0.1% |
| Other values (8) | 11 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 67217783 | |
| None | 951 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 7526561 | 11.2% | |
| a | 6864167 | 10.2% |
| t | 6255693 | 9.3% |
| i | 4779365 | 7.1% |
| e | 4733121 | 7.0% |
| n | 4583640 | 6.8% |
| c | 3759705 | 5.6% |
| o | 2896620 | 4.3% |
| , | 2856796 | 4.3% |
| r | 2271670 | 3.4% |
| Other values (60) | 20690445 |
None
| Value | Count | Frequency (%) |
| ç | 434 | |
| í | 144 | 15.1% |
| é | 141 | 14.8% |
| ó | 110 | 11.6% |
| á | 100 | 10.5% |
| ê | 7 | 0.7% |
| è | 6 | 0.6% |
| ô | 3 | 0.3% |
| ü | 2 | 0.2% |
| Ñ | 1 | 0.1% |
| Other values (3) | 3 | 0.3% |
continent
Text
Missing 
| Distinct | 78 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 585602 |
| Missing (%) | 30.4% |
| Memory size | 14.7 MiB |
Length
| Max length | 62 |
|---|---|
| Median length | 20 |
| Mean length | 18.7466614 |
| Min length | 4 |
Unique
| Unique | 10 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | North Atlantic Ocean |
|---|---|
| 2nd row | North Atlantic Ocean |
| 3rd row | North Atlantic Ocean |
| 4th row | North Atlantic Ocean |
| 5th row | Antarctic Ocean |
| Value | Count | Frequency (%) |
| ocean | 1259206 | |
| north | 1064954 | |
| atlantic | 718109 | |
| pacific | 436889 | 11.4% |
| south | 160769 | 4.2% |
| america | 74593 | 1.9% |
| indian | 50190 | 1.3% |
| antarctic | 43836 | 1.1% |
| arctic | 10182 | 0.3% |
| asia | 8415 | 0.2% |
| Other values (16) | 13313 | 0.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| c | 3039386 | |
| t | 2763886 | |
| a | 2600398 | |
| 2499997 | ||
| n | 2126760 | |
| i | 1784659 | 7.1% |
| e | 1339358 | 5.3% |
| O | 1259206 | 5.0% |
| o | 1230891 | 4.9% |
| h | 1225723 | 4.9% |
| Other values (23) | 5258867 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 18781428 | |
| Uppercase Letter | 3838823 | 15.3% |
| Space Separator | 2499997 | 9.9% |
| Other Punctuation | 8883 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| c | 3039386 | |
| t | 2763886 | |
| a | 2600398 | |
| n | 2126760 | |
| i | 1784659 | |
| e | 1339358 | |
| o | 1230891 | |
| h | 1225723 | |
| r | 1205299 | 6.4% |
| l | 721634 | 3.8% |
| Other values (10) | 743434 | 4.0% |
Uppercase Letter
| Value | Count | Frequency (%) |
| O | 1259206 | |
| N | 1064952 | |
| A | 860108 | |
| P | 436889 | 11.4% |
| S | 160772 | 4.2% |
| I | 50190 | 1.3% |
| C | 2777 | 0.1% |
| E | 2768 | 0.1% |
| U | 582 | < 0.1% |
| L | 579 | < 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 8873 | |
| ? | 10 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 2499997 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 22620251 | |
| Common | 2508880 | 10.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| c | 3039386 | |
| t | 2763886 | |
| a | 2600398 | |
| n | 2126760 | |
| i | 1784659 | |
| e | 1339358 | 5.9% |
| O | 1259206 | 5.6% |
| o | 1230891 | 5.4% |
| h | 1225723 | 5.4% |
| r | 1205299 | 5.3% |
| Other values (20) | 4044685 |
Common
| Value | Count | Frequency (%) |
| 2499997 | ||
| , | 8873 | 0.4% |
| ? | 10 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 25129131 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| c | 3039386 | |
| t | 2763886 | |
| a | 2600398 | |
| 2499997 | ||
| n | 2126760 | |
| i | 1784659 | 7.1% |
| e | 1339358 | 5.3% |
| O | 1259206 | 5.0% |
| o | 1230891 | 4.9% |
| h | 1225723 | 4.9% |
| Other values (23) | 5258867 |
waterBody
Text
Missing 
| Distinct | 1655 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 666547 |
| Missing (%) | 34.6% |
| Memory size | 14.7 MiB |
Length
| Max length | 76 |
|---|---|
| Median length | 75 |
| Mean length | 24.49177619 |
| Min length | 7 |
Unique
| Unique | 510 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | North Atlantic Ocean |
|---|---|
| 2nd row | North Atlantic Ocean, Gulf of Mexico |
| 3rd row | North Atlantic Ocean, Caribbean Sea |
| 4th row | North Atlantic Ocean, Gulf of Mexico |
| 5th row | Antarctic Ocean |
| Value | Count | Frequency (%) |
| ocean | 1259206 | |
| north | 998360 | |
| atlantic | 718109 | |
| pacific | 436889 | 9.1% |
| of | 231263 | 4.8% |
| gulf | 228590 | 4.7% |
| sea | 193861 | 4.0% |
| mexico | 187715 | 3.9% |
| south | 160358 | 3.3% |
| caribbean | 89343 | 1.9% |
| Other values (1319) | 317960 | 6.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| 3562140 | ||
| c | 3175333 | |
| a | 3112981 | 10.1% |
| t | 2738433 | 8.9% |
| n | 2331193 | 7.6% |
| i | 2082373 | 6.8% |
| e | 1823359 | 5.9% |
| o | 1648016 | 5.3% |
| O | 1260897 | 4.1% |
| r | 1217914 | 3.9% |
| Other values (53) | 7895096 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 22242939 | |
| Uppercase Letter | 4590559 | 14.9% |
| Space Separator | 3562140 | 11.5% |
| Other Punctuation | 451817 | 1.5% |
| Dash Punctuation | 276 | < 0.1% |
| Open Punctuation | 2 | < 0.1% |
| Close Punctuation | 2 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| c | 3175333 | |
| a | 3112981 | |
| t | 2738433 | |
| n | 2331193 | |
| i | 2082373 | |
| e | 1823359 | |
| o | 1648016 | |
| r | 1217914 | 5.5% |
| h | 1180259 | 5.3% |
| l | 988338 | 4.4% |
| Other values (20) | 1944740 |
Uppercase Letter
| Value | Count | Frequency (%) |
| O | 1260897 | |
| N | 1000105 | |
| A | 784132 | |
| P | 450369 | 9.8% |
| S | 386493 | 8.4% |
| G | 231815 | 5.0% |
| M | 210725 | 4.6% |
| C | 120682 | 2.6% |
| B | 53745 | 1.2% |
| I | 51170 | 1.1% |
| Other values (15) | 40426 | 0.9% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 451223 | |
| . | 464 | 0.1% |
| ' | 117 | < 0.1% |
| ? | 13 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 3562140 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 276 |
Open Punctuation
| Value | Count | Frequency (%) |
| [ | 2 |
Close Punctuation
| Value | Count | Frequency (%) |
| ] | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 26833498 | |
| Common | 4014237 | 13.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| c | 3175333 | |
| a | 3112981 | |
| t | 2738433 | |
| n | 2331193 | 8.7% |
| i | 2082373 | 7.8% |
| e | 1823359 | 6.8% |
| o | 1648016 | 6.1% |
| O | 1260897 | 4.7% |
| r | 1217914 | 4.5% |
| h | 1180259 | 4.4% |
| Other values (45) | 6262740 |
Common
| Value | Count | Frequency (%) |
| 3562140 | ||
| , | 451223 | 11.2% |
| . | 464 | < 0.1% |
| - | 276 | < 0.1% |
| ' | 117 | < 0.1% |
| ? | 13 | < 0.1% |
| [ | 2 | < 0.1% |
| ] | 2 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 30847632 | |
| None | 103 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 3562140 | ||
| c | 3175333 | |
| a | 3112981 | 10.1% |
| t | 2738433 | 8.9% |
| n | 2331193 | 7.6% |
| i | 2082373 | 6.8% |
| e | 1823359 | 5.9% |
| o | 1648016 | 5.3% |
| O | 1260897 | 4.1% |
| r | 1217914 | 3.9% |
| Other values (49) | 7894993 |
None
| Value | Count | Frequency (%) |
| í | 48 | |
| á | 46 | |
| ó | 6 | 5.8% |
| è | 3 | 2.9% |
islandGroup
Text
Missing 
| Distinct | 20 |
|---|---|
| Distinct (%) | 2.6% |
| Missing | 1925291 |
| Missing (%) | > 99.9% |
| Memory size | 14.7 MiB |
Length
| Max length | 17 |
|---|---|
| Median length | 15 |
| Mean length | 14.52857143 |
| Min length | 5 |
Unique
| Unique | 6 ? |
|---|---|
| Unique (%) | 0.8% |
Sample
| 1st row | Society Islands |
|---|---|
| 2nd row | Society Islands |
| 3rd row | Society Islands |
| 4th row | Society Islands |
| 5th row | Society Islands |
| Value | Count | Frequency (%) |
| islands | 707 | |
| society | 679 | |
| exuma | 20 | 1.3% |
| south | 12 | 0.8% |
| sandwich | 12 | 0.8% |
| florida | 10 | 0.7% |
| keys | 10 | 0.7% |
| pacific | 10 | 0.7% |
| carolina | 8 | 0.5% |
| aleutian | 7 | 0.5% |
| Other values (14) | 28 | 1.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| s | 1446 | |
| a | 803 | 7.2% |
| l | 751 | 6.7% |
| n | 748 | 6.7% |
| i | 743 | 6.6% |
| d | 738 | 6.6% |
| 733 | 6.6% | |
| o | 722 | 6.5% |
| c | 713 | 6.4% |
| e | 711 | 6.4% |
| Other values (25) | 3079 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 8951 | |
| Uppercase Letter | 1503 | 13.4% |
| Space Separator | 733 | 6.6% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| s | 1446 | |
| a | 803 | |
| l | 751 | |
| n | 748 | |
| i | 743 | |
| d | 738 | |
| o | 722 | |
| c | 713 | |
| e | 711 | |
| t | 699 | |
| Other values (11) | 877 |
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 710 | |
| S | 703 | |
| E | 21 | 1.4% |
| C | 16 | 1.1% |
| P | 12 | 0.8% |
| F | 10 | 0.7% |
| K | 10 | 0.7% |
| A | 7 | 0.5% |
| M | 6 | 0.4% |
| R | 2 | 0.1% |
| Other values (3) | 6 | 0.4% |
Space Separator
| Value | Count | Frequency (%) |
| 733 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 10454 | |
| Common | 733 | 6.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| s | 1446 | |
| a | 803 | 7.7% |
| l | 751 | 7.2% |
| n | 748 | 7.2% |
| i | 743 | 7.1% |
| d | 738 | 7.1% |
| o | 722 | 6.9% |
| c | 713 | 6.8% |
| e | 711 | 6.8% |
| I | 710 | 6.8% |
| Other values (24) | 2369 |
Common
| Value | Count | Frequency (%) |
| 733 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 11187 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| s | 1446 | |
| a | 803 | 7.2% |
| l | 751 | 6.7% |
| n | 748 | 6.7% |
| i | 743 | 6.6% |
| d | 738 | 6.6% |
| 733 | 6.6% | |
| o | 722 | 6.5% |
| c | 713 | 6.4% |
| e | 711 | 6.4% |
| Other values (25) | 3079 |
island
Text
Missing 
| Distinct | 58 |
|---|---|
| Distinct (%) | 5.9% |
| Missing | 1925083 |
| Missing (%) | 99.9% |
| Memory size | 14.7 MiB |
Length
| Max length | 25 |
|---|---|
| Median length | 6 |
| Mean length | 6.676891616 |
| Min length | 4 |
Unique
| Unique | 33 ? |
|---|---|
| Unique (%) | 3.4% |
Sample
| 1st row | Moorea |
|---|---|
| 2nd row | Moorea |
| 3rd row | Shikoku |
| 4th row | Oahu |
| 5th row | Moorea |
| Value | Count | Frequency (%) |
| moorea | 674 | |
| oahu | 147 | 13.2% |
| island | 91 | 8.2% |
| great | 20 | 1.8% |
| exuma | 20 | 1.8% |
| nunivak | 13 | 1.2% |
| eniwetok | 13 | 1.2% |
| bonaire | 11 | 1.0% |
| key | 10 | 0.9% |
| west | 10 | 0.9% |
| Other values (58) | 106 | 9.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 1430 | |
| a | 1060 | |
| e | 771 | |
| r | 737 | |
| M | 683 | |
| u | 225 | 3.4% |
| n | 186 | 2.8% |
| h | 170 | 2.6% |
| O | 154 | 2.4% |
| 137 | 2.1% | |
| Other values (39) | 977 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 5279 | |
| Uppercase Letter | 1113 | 17.0% |
| Space Separator | 137 | 2.1% |
| Other Punctuation | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 1430 | |
| a | 1060 | |
| e | 771 | |
| r | 737 | |
| u | 225 | 4.3% |
| n | 186 | 3.5% |
| h | 170 | 3.2% |
| s | 121 | 2.3% |
| d | 107 | 2.0% |
| l | 105 | 2.0% |
| Other values (16) | 367 | 7.0% |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 683 | |
| O | 154 | 13.8% |
| I | 90 | 8.1% |
| E | 35 | 3.1% |
| G | 23 | 2.1% |
| K | 21 | 1.9% |
| N | 19 | 1.7% |
| S | 19 | 1.7% |
| B | 17 | 1.5% |
| R | 11 | 1.0% |
| Other values (11) | 41 | 3.7% |
Space Separator
| Value | Count | Frequency (%) |
| 137 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 6392 | |
| Common | 138 | 2.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 1430 | |
| a | 1060 | |
| e | 771 | |
| r | 737 | |
| M | 683 | |
| u | 225 | 3.5% |
| n | 186 | 2.9% |
| h | 170 | 2.7% |
| O | 154 | 2.4% |
| s | 121 | 1.9% |
| Other values (37) | 855 |
Common
| Value | Count | Frequency (%) |
| 137 | ||
| . | 1 | 0.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6528 | |
| None | 2 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 1430 | |
| a | 1060 | |
| e | 771 | |
| r | 737 | |
| M | 683 | |
| u | 225 | 3.4% |
| n | 186 | 2.8% |
| h | 170 | 2.6% |
| O | 154 | 2.4% |
| 137 | 2.1% | |
| Other values (38) | 975 |
None
| Value | Count | Frequency (%) |
| á | 2 |
country
Text
Missing 
| Distinct | 353 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 141874 |
| Missing (%) | 7.4% |
| Memory size | 14.7 MiB |
Length
| Max length | 44 |
|---|---|
| Median length | 42 |
| Mean length | 10.90559173 |
| Min length | 4 |
Unique
| Unique | 44 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | United States |
|---|---|
| 2nd row | United States |
| 3rd row | Barbados |
| 4th row | United States |
| 5th row | Philippines |
| Value | Count | Frequency (%) |
| united | 886037 | |
| states | 871459 | |
| philippines | 93768 | 3.2% |
| mexico | 58629 | 2.0% |
| islands | 48870 | 1.7% |
| panama | 46135 | 1.6% |
| antarctica | 40202 | 1.4% |
| japan | 38460 | 1.3% |
| cuba | 30039 | 1.0% |
| new | 28719 | 1.0% |
| Other values (297) | 747880 |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 2859159 | |
| e | 2264812 | |
| a | 2126586 | |
| i | 1743725 | |
| n | 1525677 | |
| s | 1262431 | 6.5% |
| d | 1113829 | 5.7% |
| 1106011 | 5.7% | |
| S | 918175 | 4.7% |
| U | 889383 | 4.6% |
| Other values (50) | 3647827 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 15479489 | |
| Uppercase Letter | 2868864 | 14.7% |
| Space Separator | 1106011 | 5.7% |
| Other Punctuation | 3203 | < 0.1% |
| Dash Punctuation | 48 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 2859159 | |
| e | 2264812 | |
| a | 2126586 | |
| i | 1743725 | |
| n | 1525677 | |
| s | 1262431 | |
| d | 1113829 | 7.2% |
| l | 405876 | 2.6% |
| r | 307232 | 2.0% |
| o | 303838 | 2.0% |
| Other values (19) | 1566324 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 918175 | |
| U | 889383 | |
| P | 203327 | 7.1% |
| M | 115539 | 4.0% |
| C | 114327 | 4.0% |
| A | 102386 | 3.6% |
| I | 86019 | 3.0% |
| B | 74941 | 2.6% |
| J | 65993 | 2.3% |
| F | 45409 | 1.6% |
| Other values (15) | 253365 | 8.8% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 3089 | |
| ? | 94 | 2.9% |
| , | 18 | 0.6% |
| ' | 2 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 1106011 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 48 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 18348353 | |
| Common | 1109262 | 5.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 2859159 | |
| e | 2264812 | |
| a | 2126586 | |
| i | 1743725 | |
| n | 1525677 | |
| s | 1262431 | 6.9% |
| d | 1113829 | 6.1% |
| S | 918175 | 5.0% |
| U | 889383 | 4.8% |
| l | 405876 | 2.2% |
| Other values (44) | 3238700 |
Common
| Value | Count | Frequency (%) |
| 1106011 | ||
| . | 3089 | 0.3% |
| ? | 94 | < 0.1% |
| - | 48 | < 0.1% |
| , | 18 | < 0.1% |
| ' | 2 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 19457162 | |
| None | 453 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| t | 2859159 | |
| e | 2264812 | |
| a | 2126586 | |
| i | 1743725 | |
| n | 1525677 | |
| s | 1262431 | 6.5% |
| d | 1113829 | 5.7% |
| 1106011 | 5.7% | |
| S | 918175 | 4.7% |
| U | 889383 | 4.6% |
| Other values (47) | 3647374 |
None
| Value | Count | Frequency (%) |
| ç | 433 | |
| é | 18 | 4.0% |
| ô | 2 | 0.4% |
countryCode
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 1926060 |
| Missing (%) | > 99.9% |
| Memory size | 14.7 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 24 10 00 N |
|---|
| Value | Count | Frequency (%) |
| 24 | 1 | |
| 10 | 1 | |
| 00 | 1 | |
| n | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 3 | ||
| 0 | 3 | |
| 2 | 1 | 10.0% |
| 4 | 1 | 10.0% |
| 1 | 1 | 10.0% |
| N | 1 | 10.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 6 | |
| Space Separator | 3 | |
| Uppercase Letter | 1 | 10.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 3 | |
| 2 | 1 | 16.7% |
| 4 | 1 | 16.7% |
| 1 | 1 | 16.7% |
Space Separator
| Value | Count | Frequency (%) |
| 3 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 9 | |
| Latin | 1 | 10.0% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 3 | ||
| 0 | 3 | |
| 2 | 1 | 11.1% |
| 4 | 1 | 11.1% |
| 1 | 1 | 11.1% |
Latin
| Value | Count | Frequency (%) |
| N | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 10 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 3 | ||
| 0 | 3 | |
| 2 | 1 | 10.0% |
| 4 | 1 | 10.0% |
| 1 | 1 | 10.0% |
| N | 1 | 10.0% |
stateProvince
Text
Missing 
| Distinct | 1327 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 943504 |
| Missing (%) | 49.0% |
| Memory size | 14.7 MiB |
Length
| Max length | 51 |
|---|---|
| Median length | 39 |
| Mean length | 9.182602129 |
| Min length | 3 |
Unique
| Unique | 282 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Florida |
|---|---|
| 2nd row | Florida |
| 3rd row | Massachusetts |
| 4th row | Quezon |
| 5th row | Newfoundland |
| Value | Count | Frequency (%) |
| florida | 157954 | 13.1% |
| massachusetts | 103360 | 8.6% |
| california | 57075 | 4.7% |
| carolina | 53916 | 4.5% |
| texas | 43585 | 3.6% |
| alaska | 41853 | 3.5% |
| north | 31985 | 2.7% |
| louisiana | 28639 | 2.4% |
| hawaii | 26395 | 2.2% |
| south | 26207 | 2.2% |
| Other values (1254) | 634930 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 1427698 | |
| i | 808867 | 9.0% |
| s | 773106 | 8.6% |
| o | 650775 | 7.2% |
| r | 519350 | 5.8% |
| l | 506572 | 5.6% |
| n | 498591 | 5.5% |
| e | 457545 | 5.1% |
| t | 400549 | 4.4% |
| u | 277270 | 3.1% |
| Other values (63) | 2702107 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 7610434 | |
| Uppercase Letter | 1183054 | 13.1% |
| Space Separator | 223342 | 2.5% |
| Other Punctuation | 5088 | 0.1% |
| Dash Punctuation | 488 | < 0.1% |
| Close Punctuation | 8 | < 0.1% |
| Open Punctuation | 8 | < 0.1% |
| Decimal Number | 7 | < 0.1% |
| Math Symbol | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 1427698 | |
| i | 808867 | |
| s | 773106 | |
| o | 650775 | |
| r | 519350 | 6.8% |
| l | 506572 | 6.7% |
| n | 498591 | 6.6% |
| e | 457545 | 6.0% |
| t | 400549 | 5.3% |
| u | 277270 | 3.6% |
| Other values (24) | 1290111 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 171112 | |
| C | 165183 | |
| F | 164672 | |
| A | 80857 | 6.8% |
| N | 78765 | 6.7% |
| T | 76122 | 6.4% |
| S | 72411 | 6.1% |
| I | 44680 | 3.8% |
| G | 38390 | 3.2% |
| L | 36079 | 3.0% |
| Other values (17) | 254783 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 4592 | |
| . | 302 | 5.9% |
| ' | 148 | 2.9% |
| ? | 46 | 0.9% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 3 | |
| 0 | 3 | |
| 7 | 1 | 14.3% |
Space Separator
| Value | Count | Frequency (%) |
| 223342 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 488 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 8 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 8 |
Math Symbol
| Value | Count | Frequency (%) |
| | | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8793488 | |
| Common | 228942 | 2.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 1427698 | |
| i | 808867 | 9.2% |
| s | 773106 | 8.8% |
| o | 650775 | 7.4% |
| r | 519350 | 5.9% |
| l | 506572 | 5.8% |
| n | 498591 | 5.7% |
| e | 457545 | 5.2% |
| t | 400549 | 4.6% |
| u | 277270 | 3.2% |
| Other values (51) | 2473165 |
Common
| Value | Count | Frequency (%) |
| 223342 | ||
| , | 4592 | 2.0% |
| - | 488 | 0.2% |
| . | 302 | 0.1% |
| ' | 148 | 0.1% |
| ? | 46 | < 0.1% |
| ) | 8 | < 0.1% |
| ( | 8 | < 0.1% |
| 1 | 3 | < 0.1% |
| 0 | 3 | < 0.1% |
| Other values (2) | 2 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9022046 | |
| None | 384 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 1427698 | |
| i | 808867 | 9.0% |
| s | 773106 | 8.6% |
| o | 650775 | 7.2% |
| r | 519350 | 5.8% |
| l | 506572 | 5.6% |
| n | 498591 | 5.5% |
| e | 457545 | 5.1% |
| t | 400549 | 4.4% |
| u | 277270 | 3.1% |
| Other values (54) | 2701723 |
None
| Value | Count | Frequency (%) |
| é | 123 | |
| ó | 101 | |
| í | 96 | |
| á | 52 | |
| ê | 7 | 1.8% |
| è | 2 | 0.5% |
| Ñ | 1 | 0.3% |
| ú | 1 | 0.3% |
| ô | 1 | 0.3% |
county
Text
Missing 
| Distinct | 2594 |
|---|---|
| Distinct (%) | 1.9% |
| Missing | 1786110 |
| Missing (%) | 92.7% |
| Memory size | 14.7 MiB |
Length
| Max length | 46 |
|---|---|
| Median length | 43 |
| Mean length | 14.35967589 |
| Min length | 3 |
Unique
| Unique | 558 ? |
|---|---|
| Unique (%) | 0.4% |
Sample
| 1st row | Cumberland County |
|---|---|
| 2nd row | Allamakee County |
| 3rd row | St. Lucie County |
| 4th row | Delaware County |
| 5th row | Kimble County |
| Value | Count | Frequency (%) |
| county | 135403 | |
| st | 3893 | 1.3% |
| parish | 3202 | 1.1% |
| monroe | 3116 | 1.0% |
| lucie | 2649 | 0.9% |
| montgomery | 2553 | 0.9% |
| san | 2117 | 0.7% |
| prince | 1875 | 0.6% |
| george's | 1763 | 0.6% |
| jackson | 1747 | 0.6% |
| Other values (2256) | 139854 |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 223731 | |
| o | 216808 | |
| t | 181020 | 9.0% |
| u | 160899 | 8.0% |
| 158221 | 7.9% | |
| C | 152389 | 7.6% |
| y | 151799 | 7.6% |
| e | 105720 | 5.3% |
| a | 103249 | 5.1% |
| r | 74006 | 3.7% |
| Other values (55) | 481809 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1546924 | |
| Uppercase Letter | 298370 | 14.8% |
| Space Separator | 158221 | 7.9% |
| Other Punctuation | 5911 | 0.3% |
| Dash Punctuation | 225 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 223731 | |
| o | 216808 | |
| t | 181020 | |
| u | 160899 | |
| y | 151799 | |
| e | 105720 | |
| a | 103249 | |
| r | 74006 | 4.8% |
| i | 55521 | 3.6% |
| l | 50143 | 3.2% |
| Other values (22) | 224028 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 152389 | |
| M | 16354 | 5.5% |
| S | 14112 | 4.7% |
| L | 13052 | 4.4% |
| P | 12733 | 4.3% |
| B | 11991 | 4.0% |
| G | 8959 | 3.0% |
| W | 8632 | 2.9% |
| A | 8278 | 2.8% |
| D | 7831 | 2.6% |
| Other values (16) | 44039 | 14.8% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 3891 | |
| ' | 1979 | |
| , | 24 | 0.4% |
| & | 11 | 0.2% |
| / | 6 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 158221 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 225 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1845294 | |
| Common | 164357 | 8.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| n | 223731 | |
| o | 216808 | |
| t | 181020 | |
| u | 160899 | 8.7% |
| C | 152389 | 8.3% |
| y | 151799 | 8.2% |
| e | 105720 | 5.7% |
| a | 103249 | 5.6% |
| r | 74006 | 4.0% |
| i | 55521 | 3.0% |
| Other values (48) | 420152 |
Common
| Value | Count | Frequency (%) |
| 158221 | ||
| . | 3891 | 2.4% |
| ' | 1979 | 1.2% |
| - | 225 | 0.1% |
| , | 24 | < 0.1% |
| & | 11 | < 0.1% |
| / | 6 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2009642 | |
| None | 9 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| n | 223731 | |
| o | 216808 | |
| t | 181020 | 9.0% |
| u | 160899 | 8.0% |
| 158221 | 7.9% | |
| C | 152389 | 7.6% |
| y | 151799 | 7.6% |
| e | 105720 | 5.3% |
| a | 103249 | 5.1% |
| r | 74006 | 3.7% |
| Other values (49) | 481800 |
None
| Value | Count | Frequency (%) |
| ó | 3 | |
| ü | 2 | |
| ñ | 1 | 11.1% |
| ç | 1 | 11.1% |
| ø | 1 | 11.1% |
| è | 1 | 11.1% |
locality
Text
Missing 
| Distinct | 204716 |
|---|---|
| Distinct (%) | 15.9% |
| Missing | 642266 |
| Missing (%) | 33.3% |
| Memory size | 14.7 MiB |
Length
| Max length | 13524 |
|---|---|
| Median length | 378 |
| Mean length | 28.98951702 |
| Min length | 1 |
Unique
| Unique | 126299 ? |
|---|---|
| Unique (%) | 9.8% |
Sample
| 1st row | off Delaware |
|---|---|
| 2nd row | W Coast |
| 3rd row | Cape Sable, West Of |
| 4th row | Antarctic Peninsula |
| 5th row | Georges Bank |
| Value | Count | Frequency (%) |
| island | 342298 | 5.6% |
| of | 336380 | 5.5% |
| off | 252624 | 4.1% |
| bay | 137509 | 2.2% |
| islands | 98135 | 1.6% |
| bank | 84580 | 1.4% |
| south | 74622 | 1.2% |
| georges | 66648 | 1.1% |
| florida | 63420 | 1.0% |
| river | 63361 | 1.0% |
| Other values (77108) | 4634261 |
Most occurring characters
| Value | Count | Frequency (%) |
| 4868751 | 13.1% | |
| a | 3497602 | 9.4% |
| e | 2450745 | 6.6% |
| o | 2296346 | 6.2% |
| n | 2154517 | 5.8% |
| r | 1674260 | 4.5% |
| s | 1628596 | 4.4% |
| i | 1597480 | 4.3% |
| l | 1584073 | 4.3% |
| t | 1475574 | 4.0% |
| Other values (129) | 13988653 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 25258393 | |
| Uppercase Letter | 5372235 | 14.4% |
| Space Separator | 4868751 | 13.1% |
| Other Punctuation | 1209015 | 3.2% |
| Decimal Number | 423706 | 1.1% |
| Dash Punctuation | 41185 | 0.1% |
| Open Punctuation | 15167 | < 0.1% |
| Close Punctuation | 15038 | < 0.1% |
| Control | 7284 | < 0.1% |
| Math Symbol | 5035 | < 0.1% |
| Other values (7) | 788 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 3497602 | |
| e | 2450745 | |
| o | 2296346 | 9.1% |
| n | 2154517 | 8.5% |
| r | 1674260 | 6.6% |
| s | 1628596 | 6.4% |
| i | 1597480 | 6.3% |
| l | 1584073 | 6.3% |
| t | 1475574 | 5.8% |
| d | 1017847 | 4.0% |
| Other values (49) | 5881353 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 539946 | 10.1% |
| I | 501583 | 9.3% |
| B | 475968 | 8.9% |
| C | 466565 | 8.7% |
| O | 360035 | 6.7% |
| P | 312753 | 5.8% |
| M | 279542 | 5.2% |
| R | 262521 | 4.9% |
| L | 254760 | 4.7% |
| A | 250943 | 4.7% |
| Other values (19) | 1667619 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 986732 | |
| . | 147275 | 12.2% |
| ' | 31551 | 2.6% |
| ; | 24393 | 2.0% |
| / | 8160 | 0.7% |
| # | 2752 | 0.2% |
| & | 2520 | 0.2% |
| : | 2297 | 0.2% |
| " | 2103 | 0.2% |
| ? | 1193 | 0.1% |
| Other values (6) | 39 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 82668 | |
| 0 | 70820 | |
| 2 | 57121 | |
| 5 | 49646 | |
| 3 | 39680 | |
| 4 | 31892 | 7.5% |
| 6 | 30555 | 7.2% |
| 7 | 22055 | 5.2% |
| 8 | 20481 | 4.8% |
| 9 | 18788 | 4.4% |
Math Symbol
| Value | Count | Frequency (%) |
| + | 4129 | |
| > | 403 | 8.0% |
| = | 375 | 7.4% |
| ~ | 121 | 2.4% |
| < | 3 | 0.1% |
| | | 2 | < 0.1% |
| ± | 2 | < 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 14360 | |
| [ | 789 | 5.2% |
| { | 18 | 0.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 14252 | |
| ] | 776 | 5.2% |
| } | 10 | 0.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 41184 | |
| – | 1 | < 0.1% |
Control
| Value | Count | Frequency (%) |
| 7245 | ||
| 39 | 0.5% |
Space Separator
| Value | Count | Frequency (%) |
| 4868751 |
Other Symbol
| Value | Count | Frequency (%) |
| ° | 762 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 14 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ^ | 6 |
Modifier Letter
| Value | Count | Frequency (%) |
| ʻ | 3 |
Initial Punctuation
| Value | Count | Frequency (%) |
| “ | 1 |
Final Punctuation
| Value | Count | Frequency (%) |
| ” | 1 |
Currency Symbol
| Value | Count | Frequency (%) |
| $ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 30630628 | |
| Common | 6585969 | 17.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 3497602 | 11.4% |
| e | 2450745 | 8.0% |
| o | 2296346 | 7.5% |
| n | 2154517 | 7.0% |
| r | 1674260 | 5.5% |
| s | 1628596 | 5.3% |
| i | 1597480 | 5.2% |
| l | 1584073 | 5.2% |
| t | 1475574 | 4.8% |
| d | 1017847 | 3.3% |
| Other values (78) | 11253588 |
Common
| Value | Count | Frequency (%) |
| 4868751 | ||
| , | 986732 | 15.0% |
| . | 147275 | 2.2% |
| 1 | 82668 | 1.3% |
| 0 | 70820 | 1.1% |
| 2 | 57121 | 0.9% |
| 5 | 49646 | 0.8% |
| - | 41184 | 0.6% |
| 3 | 39680 | 0.6% |
| 4 | 31892 | 0.5% |
| Other values (41) | 210200 | 3.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 37214633 | |
| None | 1958 | < 0.1% |
| Modifier Letters | 3 | < 0.1% |
| Punctuation | 3 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 4868751 | 13.1% | |
| a | 3497602 | 9.4% |
| e | 2450745 | 6.6% |
| o | 2296346 | 6.2% |
| n | 2154517 | 5.8% |
| r | 1674260 | 4.5% |
| s | 1628596 | 4.4% |
| i | 1597480 | 4.3% |
| l | 1584073 | 4.3% |
| t | 1475574 | 4.0% |
| Other values (86) | 13986689 |
None
| Value | Count | Frequency (%) |
| ° | 762 | |
| é | 230 | 11.7% |
| ã | 187 | 9.6% |
| á | 141 | 7.2% |
| ó | 138 | 7.0% |
| í | 109 | 5.6% |
| ñ | 78 | 4.0% |
| ú | 55 | 2.8% |
| ç | 36 | 1.8% |
| ī | 36 | 1.8% |
| Other values (29) | 186 | 9.5% |
Modifier Letters
| Value | Count | Frequency (%) |
| ʻ | 3 |
Punctuation
| Value | Count | Frequency (%) |
| “ | 1 | |
| ” | 1 | |
| – | 1 |
Missing 
| Distinct | 1038 |
|---|---|
| Distinct (%) | 15.3% |
| Missing | 1919257 |
| Missing (%) | 99.6% |
| Memory size | 14.7 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 6 |
| Mean length | 5.359347443 |
| Min length | 3 |
Unique
| Unique | 394 ? |
|---|---|
| Unique (%) | 5.8% |
Sample
| 1st row | 783.0 |
|---|---|
| 2nd row | 15.0 |
| 3rd row | 135.0 |
| 4th row | 4070.0 |
| 5th row | 870.0 |
| Value | Count | Frequency (%) |
| 1981.0 | 618 | 9.1% |
| 135.0 | 196 | 2.9% |
| 350.0 | 165 | 2.4% |
| 348.0 | 125 | 1.8% |
| 164.0 | 123 | 1.8% |
| 149.0 | 117 | 1.7% |
| 309.0 | 116 | 1.7% |
| 388.0 | 85 | 1.2% |
| 988.0 | 82 | 1.2% |
| 1100.0 | 72 | 1.1% |
| Other values (1028) | 5105 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 9229 | |
| . | 6804 | |
| 1 | 4981 | |
| 2 | 2401 | 6.6% |
| 8 | 2371 | 6.5% |
| 3 | 2286 | 6.3% |
| 9 | 2000 | 5.5% |
| 5 | 1844 | 5.1% |
| 4 | 1684 | 4.6% |
| 7 | 1453 | 4.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 29661 | |
| Other Punctuation | 6804 | 18.7% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 9229 | |
| 1 | 4981 | |
| 2 | 2401 | 8.1% |
| 8 | 2371 | 8.0% |
| 3 | 2286 | 7.7% |
| 9 | 2000 | 6.7% |
| 5 | 1844 | 6.2% |
| 4 | 1684 | 5.7% |
| 7 | 1453 | 4.9% |
| 6 | 1412 | 4.8% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 6804 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 36465 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 9229 | |
| . | 6804 | |
| 1 | 4981 | |
| 2 | 2401 | 6.6% |
| 8 | 2371 | 6.5% |
| 3 | 2286 | 6.3% |
| 9 | 2000 | 5.5% |
| 5 | 1844 | 5.1% |
| 4 | 1684 | 4.6% |
| 7 | 1453 | 4.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 36465 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 9229 | |
| . | 6804 | |
| 1 | 4981 | |
| 2 | 2401 | 6.6% |
| 8 | 2371 | 6.5% |
| 3 | 2286 | 6.3% |
| 9 | 2000 | 5.5% |
| 5 | 1844 | 5.1% |
| 4 | 1684 | 4.6% |
| 7 | 1453 | 4.0% |
Missing 
| Distinct | 725 |
|---|---|
| Distinct (%) | 20.6% |
| Missing | 1922544 |
| Missing (%) | 99.8% |
| Memory size | 14.7 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 6 |
| Mean length | 5.355416548 |
| Min length | 3 |
Unique
| Unique | 261 ? |
|---|---|
| Unique (%) | 7.4% |
Sample
| 1st row | 783.0 |
|---|---|
| 2nd row | 15.0 |
| 3rd row | 185.0 |
| 4th row | 870.0 |
| 5th row | 853.0 |
| Value | Count | Frequency (%) |
| 185.0 | 198 | 5.6% |
| 914.0 | 57 | 1.6% |
| 1524.0 | 48 | 1.4% |
| 1100.0 | 45 | 1.3% |
| 610.0 | 40 | 1.1% |
| 1219.0 | 37 | 1.1% |
| 1829.0 | 34 | 1.0% |
| 2.0 | 33 | 0.9% |
| 1372.0 | 33 | 0.9% |
| 65.0 | 32 | 0.9% |
| Other values (715) | 2960 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 5068 | |
| . | 3516 | |
| 1 | 2463 | |
| 2 | 1466 | 7.8% |
| 5 | 1254 | 6.7% |
| 3 | 983 | 5.2% |
| 8 | 942 | 5.0% |
| 6 | 851 | 4.5% |
| 4 | 837 | 4.4% |
| 7 | 736 | 3.9% |
| Other values (2) | 719 | 3.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 15318 | |
| Other Punctuation | 3516 | 18.7% |
| Dash Punctuation | 1 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 5068 | |
| 1 | 2463 | |
| 2 | 1466 | 9.6% |
| 5 | 1254 | 8.2% |
| 3 | 983 | 6.4% |
| 8 | 942 | 6.1% |
| 6 | 851 | 5.6% |
| 4 | 837 | 5.5% |
| 7 | 736 | 4.8% |
| 9 | 718 | 4.7% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 3516 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 18835 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 5068 | |
| . | 3516 | |
| 1 | 2463 | |
| 2 | 1466 | 7.8% |
| 5 | 1254 | 6.7% |
| 3 | 983 | 5.2% |
| 8 | 942 | 5.0% |
| 6 | 851 | 4.5% |
| 4 | 837 | 4.4% |
| 7 | 736 | 3.9% |
| Other values (2) | 719 | 3.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 18835 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 5068 | |
| . | 3516 | |
| 1 | 2463 | |
| 2 | 1466 | 7.8% |
| 5 | 1254 | 6.7% |
| 3 | 983 | 5.2% |
| 8 | 942 | 5.0% |
| 6 | 851 | 4.5% |
| 4 | 837 | 4.4% |
| 7 | 736 | 3.9% |
| Other values (2) | 719 | 3.8% |
Missing 
| Distinct | 126 |
|---|---|
| Distinct (%) | 27.3% |
| Missing | 1925599 |
| Missing (%) | > 99.9% |
| Memory size | 14.7 MiB |
Length
| Max length | 44 |
|---|---|
| Median length | 4 |
| Mean length | 10.17099567 |
| Min length | 4 |
Unique
| Unique | 65 ? |
|---|---|
| Unique (%) | 14.1% |
Sample
| 1st row | 7000 |
|---|---|
| 2nd row | 4070 m.a.s.l. |
| 3rd row | 4200-4400 |
| 4th row | 2009 +/- 20.1 feet |
| 5th row | 3000 |
| Value | Count | Frequency (%) |
| collected | 53 | 5.6% |
| on | 53 | 5.6% |
| and | 51 | 5.4% |
| flat | 50 | 5.3% |
| lagoon | 50 | 5.3% |
| slope | 50 | 5.3% |
| m | 27 | 2.8% |
| 3800 | 23 | 2.4% |
| 2550 | 21 | 2.2% |
| above | 19 | 2.0% |
| Other values (148) | 554 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 660 | |
| 489 | 10.4% | |
| l | 346 | 7.4% |
| e | 330 | 7.0% |
| o | 320 | 6.8% |
| a | 237 | 5.0% |
| 3 | 219 | 4.7% |
| 5 | 218 | 4.6% |
| t | 202 | 4.3% |
| n | 193 | 4.1% |
| Other values (41) | 1485 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2418 | |
| Decimal Number | 1576 | |
| Space Separator | 489 | 10.4% |
| Other Punctuation | 72 | 1.5% |
| Uppercase Letter | 70 | 1.5% |
| Dash Punctuation | 34 | 0.7% |
| Open Punctuation | 17 | 0.4% |
| Close Punctuation | 17 | 0.4% |
| Math Symbol | 6 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| l | 346 | |
| e | 330 | |
| o | 320 | |
| a | 237 | |
| t | 202 | |
| n | 193 | |
| s | 124 | 5.1% |
| d | 118 | 4.9% |
| f | 84 | 3.5% |
| m | 80 | 3.3% |
| Other values (13) | 384 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 660 | |
| 3 | 219 | 13.9% |
| 5 | 218 | 13.8% |
| 2 | 127 | 8.1% |
| 4 | 98 | 6.2% |
| 8 | 74 | 4.7% |
| 1 | 68 | 4.3% |
| 7 | 48 | 3.0% |
| 9 | 43 | 2.7% |
| 6 | 21 | 1.3% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 40 | |
| ' | 20 | |
| ? | 6 | 8.3% |
| , | 3 | 4.2% |
| / | 2 | 2.8% |
| ; | 1 | 1.4% |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 51 | |
| E | 15 | 21.4% |
| A | 2 | 2.9% |
| I | 1 | 1.4% |
| T | 1 | 1.4% |
Math Symbol
| Value | Count | Frequency (%) |
| ~ | 3 | |
| + | 2 | |
| > | 1 | 16.7% |
Space Separator
| Value | Count | Frequency (%) |
| 489 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 34 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 17 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 17 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2488 | |
| Common | 2211 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| l | 346 | |
| e | 330 | |
| o | 320 | |
| a | 237 | |
| t | 202 | |
| n | 193 | |
| s | 124 | 5.0% |
| d | 118 | 4.7% |
| f | 84 | 3.4% |
| m | 80 | 3.2% |
| Other values (18) | 454 |
Common
| Value | Count | Frequency (%) |
| 0 | 660 | |
| 489 | ||
| 3 | 219 | 9.9% |
| 5 | 218 | 9.9% |
| 2 | 127 | 5.7% |
| 4 | 98 | 4.4% |
| 8 | 74 | 3.3% |
| 1 | 68 | 3.1% |
| 7 | 48 | 2.2% |
| 9 | 43 | 1.9% |
| Other values (13) | 167 | 7.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4699 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 660 | |
| 489 | 10.4% | |
| l | 346 | 7.4% |
| e | 330 | 7.0% |
| o | 320 | 6.8% |
| a | 237 | 5.0% |
| 3 | 219 | 4.7% |
| 5 | 218 | 4.6% |
| t | 202 | 4.3% |
| n | 193 | 4.1% |
| Other values (41) | 1485 |
verticalDatum
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 1926060 |
| Missing (%) | > 99.9% |
| Memory size | 14.7 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 152 |
|---|
| Value | Count | Frequency (%) |
| 152 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 1 | |
| 5 | 1 | |
| 2 | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1 | |
| 5 | 1 | |
| 2 | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 1 | |
| 5 | 1 | |
| 2 | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 1 | |
| 5 | 1 | |
| 2 | 1 |
Missing 
| Distinct | 6902 |
|---|---|
| Distinct (%) | 0.9% |
| Missing | 1143588 |
| Missing (%) | 59.4% |
| Memory size | 14.7 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 7 |
| Mean length | 4.378034769 |
| Min length | 3 |
Unique
| Unique | 2028 ? |
|---|---|
| Unique (%) | 0.3% |
Sample
| 1st row | 77.0 |
|---|---|
| 2nd row | 50.0 |
| 3rd row | 74.0 |
| 4th row | 265.0 |
| 5th row | 75.0 |
| Value | Count | Frequency (%) |
| 0.0 | 45226 | 5.8% |
| 1.0 | 16089 | 2.1% |
| 18.0 | 10809 | 1.4% |
| 2.0 | 9889 | 1.3% |
| 15.0 | 9294 | 1.2% |
| 84.0 | 9270 | 1.2% |
| 82.0 | 8938 | 1.1% |
| 3.0 | 8672 | 1.1% |
| 27.0 | 8646 | 1.1% |
| 55.0 | 8479 | 1.1% |
| Other values (6887) | 647161 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 981792 | |
| . | 782472 | |
| 1 | 321150 | 9.4% |
| 2 | 239243 | 7.0% |
| 5 | 194428 | 5.7% |
| 3 | 185474 | 5.4% |
| 4 | 175086 | 5.1% |
| 8 | 152567 | 4.5% |
| 6 | 145116 | 4.2% |
| 7 | 129322 | 3.8% |
| Other values (2) | 119044 | 3.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2643180 | |
| Other Punctuation | 782472 | 22.8% |
| Dash Punctuation | 42 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 981792 | |
| 1 | 321150 | 12.2% |
| 2 | 239243 | 9.1% |
| 5 | 194428 | 7.4% |
| 3 | 185474 | 7.0% |
| 4 | 175086 | 6.6% |
| 8 | 152567 | 5.8% |
| 6 | 145116 | 5.5% |
| 7 | 129322 | 4.9% |
| 9 | 119002 | 4.5% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 782472 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 42 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3425694 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 981792 | |
| . | 782472 | |
| 1 | 321150 | 9.4% |
| 2 | 239243 | 7.0% |
| 5 | 194428 | 5.7% |
| 3 | 185474 | 5.4% |
| 4 | 175086 | 5.1% |
| 8 | 152567 | 4.5% |
| 6 | 145116 | 4.2% |
| 7 | 129322 | 3.8% |
| Other values (2) | 119044 | 3.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3425694 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 981792 | |
| . | 782472 | |
| 1 | 321150 | 9.4% |
| 2 | 239243 | 7.0% |
| 5 | 194428 | 5.7% |
| 3 | 185474 | 5.4% |
| 4 | 175086 | 5.1% |
| 8 | 152567 | 4.5% |
| 6 | 145116 | 4.2% |
| 7 | 129322 | 3.8% |
| Other values (2) | 119044 | 3.5% |
Missing 
| Distinct | 6653 |
|---|---|
| Distinct (%) | 0.9% |
| Missing | 1205034 |
| Missing (%) | 62.6% |
| Memory size | 14.7 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 7 |
| Mean length | 4.453148079 |
| Min length | 3 |
Unique
| Unique | 1921 ? |
|---|---|
| Unique (%) | 0.3% |
Sample
| 1st row | 77.0 |
|---|---|
| 2nd row | 400.0 |
| 3rd row | 74.0 |
| 4th row | 265.0 |
| 5th row | 75.0 |
| Value | Count | Frequency (%) |
| 1.0 | 27351 | 3.8% |
| 2.0 | 10561 | 1.5% |
| 18.0 | 9723 | 1.3% |
| 84.0 | 9131 | 1.3% |
| 3.0 | 8394 | 1.2% |
| 55.0 | 7481 | 1.0% |
| 27.0 | 7196 | 1.0% |
| 37.0 | 6993 | 1.0% |
| 5.0 | 6769 | 0.9% |
| 0.0 | 6747 | 0.9% |
| Other values (6638) | 620681 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 885387 | |
| . | 721026 | |
| 1 | 323911 | 10.1% |
| 2 | 234786 | 7.3% |
| 5 | 184685 | 5.8% |
| 3 | 176891 | 5.5% |
| 4 | 166718 | 5.2% |
| 8 | 143338 | 4.5% |
| 6 | 138529 | 4.3% |
| 7 | 123043 | 3.8% |
| Other values (2) | 112526 | 3.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2489772 | |
| Other Punctuation | 721026 | 22.5% |
| Dash Punctuation | 42 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 885387 | |
| 1 | 323911 | 13.0% |
| 2 | 234786 | 9.4% |
| 5 | 184685 | 7.4% |
| 3 | 176891 | 7.1% |
| 4 | 166718 | 6.7% |
| 8 | 143338 | 5.8% |
| 6 | 138529 | 5.6% |
| 7 | 123043 | 4.9% |
| 9 | 112484 | 4.5% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 721026 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 42 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3210840 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 885387 | |
| . | 721026 | |
| 1 | 323911 | 10.1% |
| 2 | 234786 | 7.3% |
| 5 | 184685 | 5.8% |
| 3 | 176891 | 5.5% |
| 4 | 166718 | 5.2% |
| 8 | 143338 | 4.5% |
| 6 | 138529 | 4.3% |
| 7 | 123043 | 3.8% |
| Other values (2) | 112526 | 3.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3210840 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 885387 | |
| . | 721026 | |
| 1 | 323911 | 10.1% |
| 2 | 234786 | 7.3% |
| 5 | 184685 | 5.8% |
| 3 | 176891 | 5.5% |
| 4 | 166718 | 5.2% |
| 8 | 143338 | 4.5% |
| 6 | 138529 | 4.3% |
| 7 | 123043 | 3.8% |
| Other values (2) | 112526 | 3.5% |
verbatimDepth
Text
Missing 
| Distinct | 1531 |
|---|---|
| Distinct (%) | 5.8% |
| Missing | 1899821 |
| Missing (%) | 98.6% |
| Memory size | 14.7 MiB |
Length
| Max length | 99 |
|---|---|
| Median length | 91 |
| Mean length | 13.4351753 |
| Min length | 1 |
Unique
| Unique | 722 ? |
|---|---|
| Unique (%) | 2.8% |
Sample
| 1st row | Surface |
|---|---|
| 2nd row | max depth 1772 ft |
| 3rd row | surface |
| 4th row | Intertidal |
| 5th row | Intertidal |
| Value | Count | Frequency (%) |
| intertidal | 11930 | |
| surface | 4084 | 8.0% |
| recorded | 2869 | 5.6% |
| depths | 2848 | 5.6% |
| multiple | 2844 | 5.6% |
| shore | 1165 | 2.3% |
| 0-300 | 1120 | 2.2% |
| 0 | 1067 | 2.1% |
| depth | 1023 | 2.0% |
| low | 964 | 1.9% |
| Other values (1043) | 21001 |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 36679 | 10.4% |
| e | 35131 | 10.0% |
| r | 25384 | 7.2% |
| 24675 | 7.0% | |
| d | 24169 | 6.9% |
| l | 20645 | 5.9% |
| a | 20478 | 5.8% |
| i | 19388 | 5.5% |
| 0 | 16025 | 4.5% |
| n | 14725 | 4.2% |
| Other values (69) | 115240 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 250835 | |
| Decimal Number | 39286 | 11.1% |
| Space Separator | 24675 | 7.0% |
| Uppercase Letter | 19955 | 5.7% |
| Other Punctuation | 12435 | 3.5% |
| Dash Punctuation | 4880 | 1.4% |
| Math Symbol | 236 | 0.1% |
| Open Punctuation | 118 | < 0.1% |
| Close Punctuation | 118 | < 0.1% |
| Final Punctuation | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 36679 | |
| e | 35131 | |
| r | 25384 | |
| d | 24169 | |
| l | 20645 | |
| a | 20478 | |
| i | 19388 | |
| n | 14725 | 5.9% |
| c | 8173 | 3.3% |
| p | 7641 | 3.0% |
| Other values (15) | 38422 |
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 10832 | |
| S | 4488 | |
| M | 2984 | 15.0% |
| L | 758 | 3.8% |
| T | 218 | 1.1% |
| B | 109 | 0.5% |
| H | 83 | 0.4% |
| D | 78 | 0.4% |
| C | 73 | 0.4% |
| Z | 59 | 0.3% |
| Other values (14) | 273 | 1.4% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 5996 | |
| : | 3686 | |
| . | 1398 | 11.2% |
| " | 841 | 6.8% |
| ; | 207 | 1.7% |
| ' | 201 | 1.6% |
| @ | 43 | 0.3% |
| / | 29 | 0.2% |
| & | 22 | 0.2% |
| ? | 10 | 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 16025 | |
| 1 | 4889 | 12.4% |
| 2 | 3728 | 9.5% |
| 3 | 3378 | 8.6% |
| 5 | 2938 | 7.5% |
| 8 | 2555 | 6.5% |
| 4 | 1746 | 4.4% |
| 6 | 1719 | 4.4% |
| 7 | 1431 | 3.6% |
| 9 | 877 | 2.2% |
Math Symbol
| Value | Count | Frequency (%) |
| < | 138 | |
| = | 60 | |
| + | 24 | 10.2% |
| ~ | 14 | 5.9% |
Space Separator
| Value | Count | Frequency (%) |
| 24675 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 4880 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 118 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 118 |
Final Punctuation
| Value | Count | Frequency (%) |
| ” | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 270790 | |
| Common | 81749 | 23.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 36679 | |
| e | 35131 | |
| r | 25384 | |
| d | 24169 | |
| l | 20645 | 7.6% |
| a | 20478 | 7.6% |
| i | 19388 | 7.2% |
| n | 14725 | 5.4% |
| I | 10832 | 4.0% |
| c | 8173 | 3.0% |
| Other values (39) | 55186 |
Common
| Value | Count | Frequency (%) |
| 24675 | ||
| 0 | 16025 | |
| , | 5996 | 7.3% |
| 1 | 4889 | 6.0% |
| - | 4880 | 6.0% |
| 2 | 3728 | 4.6% |
| : | 3686 | 4.5% |
| 3 | 3378 | 4.1% |
| 5 | 2938 | 3.6% |
| 8 | 2555 | 3.1% |
| Other values (20) | 8999 | 11.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 352538 | |
| Punctuation | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| t | 36679 | 10.4% |
| e | 35131 | 10.0% |
| r | 25384 | 7.2% |
| 24675 | 7.0% | |
| d | 24169 | 6.9% |
| l | 20645 | 5.9% |
| a | 20478 | 5.8% |
| i | 19388 | 5.5% |
| 0 | 16025 | 4.5% |
| n | 14725 | 4.2% |
| Other values (68) | 115239 |
Punctuation
| Value | Count | Frequency (%) |
| ” | 1 |
decimalLatitude
Text
Missing 
| Distinct | 70077 |
|---|---|
| Distinct (%) | 7.0% |
| Missing | 927243 |
| Missing (%) | 48.1% |
| Memory size | 14.7 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 7 |
| Mean length | 6.23583776 |
| Min length | 3 |
Unique
| Unique | 26221 ? |
|---|---|
| Unique (%) | 2.6% |
Sample
| 1st row | 38.7117 |
|---|---|
| 2nd row | 25.2819 |
| 3rd row | -62.667 |
| 4th row | 42.0833 |
| 5th row | 13.7792 |
| Value | Count | Frequency (%) |
| 25.58 | 10487 | 1.0% |
| 40.6583 | 8820 | 0.9% |
| 26.17 | 7319 | 0.7% |
| 26.5 | 5191 | 0.5% |
| 26.97 | 3956 | 0.4% |
| 25.7883 | 3456 | 0.3% |
| 9.4 | 3108 | 0.3% |
| 9.37 | 2976 | 0.3% |
| 40.895 | 2589 | 0.3% |
| 40.66 | 2520 | 0.3% |
| Other values (65547) | 948396 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 998818 | |
| 3 | 788063 | |
| 2 | 616041 | |
| 5 | 525246 | |
| 7 | 524886 | |
| 4 | 501449 | |
| 1 | 480642 | |
| 6 | 474819 | |
| 8 | 472165 | |
| 9 | 377071 | 6.1% |
| Other values (3) | 469267 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 5077658 | |
| Other Punctuation | 998818 | 16.0% |
| Dash Punctuation | 151990 | 2.4% |
| Uppercase Letter | 1 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 788063 | |
| 2 | 616041 | |
| 5 | 525246 | |
| 7 | 524886 | |
| 4 | 501449 | |
| 1 | 480642 | |
| 6 | 474819 | |
| 8 | 472165 | |
| 9 | 377071 | |
| 0 | 317276 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 998818 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 151990 |
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 6228466 | |
| Latin | 1 | < 0.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| . | 998818 | |
| 3 | 788063 | |
| 2 | 616041 | |
| 5 | 525246 | |
| 7 | 524886 | |
| 4 | 501449 | |
| 1 | 480642 | |
| 6 | 474819 | |
| 8 | 472165 | |
| 9 | 377071 | 6.1% |
| Other values (2) | 469266 |
Latin
| Value | Count | Frequency (%) |
| E | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6228467 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 998818 | |
| 3 | 788063 | |
| 2 | 616041 | |
| 5 | 525246 | |
| 7 | 524886 | |
| 4 | 501449 | |
| 1 | 480642 | |
| 6 | 474819 | |
| 8 | 472165 | |
| 9 | 377071 | 6.1% |
| Other values (3) | 469267 |
decimalLongitude
Text
Missing 
| Distinct | 74667 |
|---|---|
| Distinct (%) | 7.5% |
| Missing | 927246 |
| Missing (%) | 48.1% |
| Memory size | 14.7 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 8 |
| Mean length | 7.110719202 |
| Min length | 3 |
Unique
| Unique | 27319 ? |
|---|---|
| Unique (%) | 2.7% |
Sample
| 1st row | -73.405 |
|---|---|
| 2nd row | -83.6297 |
| 3rd row | -54.742 |
| 4th row | -66.7708 |
| 5th row | 121.586 |
| Value | Count | Frequency (%) |
| 80.1 | 10527 | 1.1% |
| 127.848 | 4531 | 0.5% |
| 67.7683 | 4212 | 0.4% |
| 80.13 | 3737 | 0.4% |
| 82.7 | 3517 | 0.4% |
| 67.77 | 2820 | 0.3% |
| 66.775 | 2591 | 0.3% |
| 81.6633 | 2462 | 0.2% |
| 70.6731 | 2397 | 0.2% |
| 67.755 | 2355 | 0.2% |
| Other values (69814) | 959666 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 998815 | |
| - | 826053 | |
| 7 | 744517 | |
| 8 | 682555 | |
| 1 | 674556 | |
| 6 | 575271 | |
| 3 | 562177 | |
| 2 | 472510 | |
| 5 | 433016 | |
| 9 | 409778 | |
| Other values (2) | 723045 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 5277425 | |
| Other Punctuation | 998815 | 14.1% |
| Dash Punctuation | 826053 | 11.6% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 7 | 744517 | |
| 8 | 682555 | |
| 1 | 674556 | |
| 6 | 575271 | |
| 3 | 562177 | |
| 2 | 472510 | |
| 5 | 433016 | |
| 9 | 409778 | |
| 0 | 371185 | |
| 4 | 351860 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 998815 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 826053 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 7102293 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| . | 998815 | |
| - | 826053 | |
| 7 | 744517 | |
| 8 | 682555 | |
| 1 | 674556 | |
| 6 | 575271 | |
| 3 | 562177 | |
| 2 | 472510 | |
| 5 | 433016 | |
| 9 | 409778 | |
| Other values (2) | 723045 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 7102293 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 998815 | |
| - | 826053 | |
| 7 | 744517 | |
| 8 | 682555 | |
| 1 | 674556 | |
| 6 | 575271 | |
| 3 | 562177 | |
| 2 | 472510 | |
| 5 | 433016 | |
| 9 | 409778 | |
| Other values (2) | 723045 |
geodeticDatum
Text
Missing 
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1858158 |
| Missing (%) | 96.5% |
| Memory size | 14.7 MiB |
Length
| Max length | 18 |
|---|---|
| Median length | 5 |
| Mean length | 5.17221625 |
| Min length | 5 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | WGS84 |
|---|---|
| 2nd row | WGS84 |
| 3rd row | WGS84 |
| 4th row | WGS84 |
| 5th row | WGS84 |
| Value | Count | Frequency (%) |
| wgs84 | 67002 | |
| wgs | 896 | 1.3% |
| 84 | 896 | 1.3% |
| epsg:4326 | 896 | 1.3% |
| nad83 | 3 | < 0.1% |
| epsg:4269 | 3 | < 0.1% |
| 1936-08-14 | 1 | < 0.1% |
| 1926-08-24 | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 4 | 68799 | |
| G | 68797 | |
| S | 68797 | |
| 8 | 67903 | |
| W | 67898 | |
| 1795 | 0.5% | |
| 6 | 901 | 0.3% |
| 2 | 901 | 0.3% |
| 3 | 900 | 0.3% |
| : | 899 | 0.3% |
| Other values (11) | 3619 | 1.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 207299 | |
| Decimal Number | 139414 | |
| Space Separator | 1795 | 0.5% |
| Other Punctuation | 899 | 0.3% |
| Open Punctuation | 899 | 0.3% |
| Close Punctuation | 899 | 0.3% |
| Dash Punctuation | 4 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 68799 | |
| 8 | 67903 | |
| 6 | 901 | 0.6% |
| 2 | 901 | 0.6% |
| 3 | 900 | 0.6% |
| 9 | 5 | < 0.1% |
| 1 | 3 | < 0.1% |
| 0 | 2 | < 0.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| G | 68797 | |
| S | 68797 | |
| W | 67898 | |
| P | 899 | 0.4% |
| E | 899 | 0.4% |
| N | 3 | < 0.1% |
| A | 3 | < 0.1% |
| D | 3 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 1795 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 899 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 899 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 899 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 4 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 207299 | |
| Common | 143910 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 4 | 68799 | |
| 8 | 67903 | |
| 1795 | 1.2% | |
| 6 | 901 | 0.6% |
| 2 | 901 | 0.6% |
| 3 | 900 | 0.6% |
| : | 899 | 0.6% |
| ( | 899 | 0.6% |
| ) | 899 | 0.6% |
| 9 | 5 | < 0.1% |
| Other values (3) | 9 | < 0.1% |
Latin
| Value | Count | Frequency (%) |
| G | 68797 | |
| S | 68797 | |
| W | 67898 | |
| P | 899 | 0.4% |
| E | 899 | 0.4% |
| N | 3 | < 0.1% |
| A | 3 | < 0.1% |
| D | 3 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 351209 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 4 | 68799 | |
| G | 68797 | |
| S | 68797 | |
| 8 | 67903 | |
| W | 67898 | |
| 1795 | 0.5% | |
| 6 | 901 | 0.3% |
| 2 | 901 | 0.3% |
| 3 | 900 | 0.3% |
| : | 899 | 0.3% |
| Other values (11) | 3619 | 1.0% |
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 1926059 |
| Missing (%) | > 99.9% |
| Memory size | 14.7 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 227 |
|---|---|
| 2nd row | 236 |
| Value | Count | Frequency (%) |
| 227 | 1 | |
| 236 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 3 | |
| 7 | 1 | 16.7% |
| 3 | 1 | 16.7% |
| 6 | 1 | 16.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 6 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 3 | |
| 7 | 1 | 16.7% |
| 3 | 1 | 16.7% |
| 6 | 1 | 16.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 6 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 3 | |
| 7 | 1 | 16.7% |
| 3 | 1 | 16.7% |
| 6 | 1 | 16.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 3 | |
| 7 | 1 | 16.7% |
| 3 | 1 | 16.7% |
| 6 | 1 | 16.7% |
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 1926059 |
| Missing (%) | > 99.9% |
| Memory size | 14.7 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 227 |
|---|---|
| 2nd row | 236 |
| Value | Count | Frequency (%) |
| 227 | 1 | |
| 236 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 3 | |
| 7 | 1 | 16.7% |
| 3 | 1 | 16.7% |
| 6 | 1 | 16.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 6 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 3 | |
| 7 | 1 | 16.7% |
| 3 | 1 | 16.7% |
| 6 | 1 | 16.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 6 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 3 | |
| 7 | 1 | 16.7% |
| 3 | 1 | 16.7% |
| 6 | 1 | 16.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 3 | |
| 7 | 1 | 16.7% |
| 3 | 1 | 16.7% |
| 6 | 1 | 16.7% |
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 1926059 |
| Missing (%) | > 99.9% |
| Memory size | 14.7 MiB |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 1936 |
|---|---|
| 2nd row | 1926 |
| Value | Count | Frequency (%) |
| 1936 | 1 | |
| 1926 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 2 | |
| 9 | 2 | |
| 6 | 2 | |
| 3 | 1 | |
| 2 | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 8 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 2 | |
| 9 | 2 | |
| 6 | 2 | |
| 3 | 1 | |
| 2 | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 8 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 2 | |
| 9 | 2 | |
| 6 | 2 | |
| 3 | 1 | |
| 2 | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 2 | |
| 9 | 2 | |
| 6 | 2 | |
| 3 | 1 | |
| 2 | 1 |
verbatimLatitude
Text
Missing 
| Distinct | 13526 |
|---|---|
| Distinct (%) | 18.9% |
| Missing | 1854408 |
| Missing (%) | 96.3% |
| Memory size | 14.7 MiB |
Length
| Max length | 81176 |
|---|---|
| Median length | 33373 |
| Mean length | 13.09566941 |
| Min length | 1 |
Unique
| Unique | 6600 ? |
|---|---|
| Unique (%) | 9.2% |
Sample
| 1st row | 12.083197 |
|---|---|
| 2nd row | 35 00.11 N |
| 3rd row | 21.502905 |
| 4th row | 29 47.5 N |
| 5th row | 36.4512 |
| Value | Count | Frequency (%) |
| n | 42003 | 19.9% |
| 29 | 8560 | 4.0% |
| s | 7189 | 3.4% |
| 28 | 6242 | 3.0% |
| 27 | 6074 | 2.9% |
| 00 | 3976 | 1.9% |
| 26 | 2567 | 1.2% |
| 24 | 2186 | 1.0% |
| 23 | 1948 | 0.9% |
| 42 | 1918 | 0.9% |
| Other values (13451) | 128767 |
Most occurring characters
| Value | Count | Frequency (%) |
| 128576 | 13.7% | |
| 2 | 75402 | 8.0% |
| 62317 | 6.6% | |
| 0 | 55150 | 5.9% |
| 3 | 51265 | 5.5% |
| N | 50953 | 5.4% |
| 4 | 50156 | 5.3% |
| . | 47966 | 5.1% |
| 1 | 45741 | 4.9% |
| 5 | 44030 | 4.7% |
| Other values (94) | 326788 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 467432 | |
| Space Separator | 128576 | 13.7% |
| Lowercase Letter | 118832 | 12.7% |
| Uppercase Letter | 78978 | 8.4% |
| Other Punctuation | 65558 | 7.0% |
| Control | 62650 | 6.7% |
| Dash Punctuation | 9429 | 1.0% |
| Other Symbol | 5148 | 0.5% |
| Other Letter | 477 | 0.1% |
| Close Punctuation | 372 | < 0.1% |
| Other values (6) | 892 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 14870 | |
| e | 11139 | |
| i | 10564 | 8.9% |
| t | 10203 | 8.6% |
| o | 8641 | 7.3% |
| n | 8286 | 7.0% |
| d | 7287 | 6.1% |
| c | 7250 | 6.1% |
| l | 6889 | 5.8% |
| r | 6805 | 5.7% |
| Other values (20) | 26898 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 50953 | |
| S | 10837 | 13.7% |
| M | 2290 | 2.9% |
| A | 2082 | 2.6% |
| L | 1490 | 1.9% |
| U | 1211 | 1.5% |
| P | 1190 | 1.5% |
| O | 973 | 1.2% |
| D | 972 | 1.2% |
| E | 948 | 1.2% |
| Other values (16) | 6032 | 7.6% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 47966 | |
| ' | 4497 | 6.9% |
| : | 3498 | 5.3% |
| ; | 2874 | 4.4% |
| , | 2694 | 4.1% |
| / | 2078 | 3.2% |
| " | 1092 | 1.7% |
| ′ | 398 | 0.6% |
| * | 149 | 0.2% |
| ? | 125 | 0.2% |
| Other values (5) | 187 | 0.3% |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 75402 | |
| 0 | 55150 | |
| 3 | 51265 | |
| 4 | 50156 | |
| 1 | 45741 | |
| 5 | 44030 | |
| 9 | 39569 | |
| 7 | 37449 | |
| 8 | 36950 | |
| 6 | 31720 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 361 | |
| } | 10 | 2.7% |
| ] | 1 | 0.3% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 361 | |
| { | 10 | 2.7% |
| [ | 1 | 0.3% |
Math Symbol
| Value | Count | Frequency (%) |
| = | 57 | |
| ~ | 13 | 18.3% |
| + | 1 | 1.4% |
Control
| Value | Count | Frequency (%) |
| 62317 | ||
| 333 | 0.5% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 9426 | |
| – | 3 | < 0.1% |
Other Symbol
| Value | Count | Frequency (%) |
| ° | 5146 | |
| ◦ | 2 | < 0.1% |
Modifier Symbol
| Value | Count | Frequency (%) |
| ´ | 82 | |
| ˚ | 14 | 14.6% |
Modifier Letter
| Value | Count | Frequency (%) |
| ʹ | 26 | |
| ʺ | 1 | 3.7% |
Space Separator
| Value | Count | Frequency (%) |
| 128576 |
Other Letter
| Value | Count | Frequency (%) |
| º | 477 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 227 |
Final Punctuation
| Value | Count | Frequency (%) |
| ” | 99 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 740057 | |
| Latin | 198287 | 21.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| N | 50953 | |
| a | 14870 | 7.5% |
| e | 11139 | 5.6% |
| S | 10837 | 5.5% |
| i | 10564 | 5.3% |
| t | 10203 | 5.1% |
| o | 8641 | 4.4% |
| n | 8286 | 4.2% |
| d | 7287 | 3.7% |
| c | 7250 | 3.7% |
| Other values (47) | 58257 |
Common
| Value | Count | Frequency (%) |
| 128576 | ||
| 2 | 75402 | |
| 62317 | ||
| 0 | 55150 | 7.5% |
| 3 | 51265 | 6.9% |
| 4 | 50156 | 6.8% |
| . | 47966 | 6.5% |
| 1 | 45741 | 6.2% |
| 5 | 44030 | 5.9% |
| 9 | 39569 | 5.3% |
| Other values (37) | 139885 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 931984 | |
| None | 5715 | 0.6% |
| Punctuation | 602 | 0.1% |
| Modifier Letters | 41 | < 0.1% |
| Geometric Shapes | 2 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 128576 | ||
| 2 | 75402 | 8.1% |
| 62317 | 6.7% | |
| 0 | 55150 | 5.9% |
| 3 | 51265 | 5.5% |
| N | 50953 | 5.5% |
| 4 | 50156 | 5.4% |
| . | 47966 | 5.1% |
| 1 | 45741 | 4.9% |
| 5 | 44030 | 4.7% |
| Other values (79) | 320428 |
None
| Value | Count | Frequency (%) |
| ° | 5146 | |
| º | 477 | 8.3% |
| ´ | 82 | 1.4% |
| ü | 4 | 0.1% |
| è | 3 | 0.1% |
| é | 2 | < 0.1% |
| ö | 1 | < 0.1% |
Punctuation
| Value | Count | Frequency (%) |
| ′ | 398 | |
| ″ | 102 | 16.9% |
| ” | 99 | 16.4% |
| – | 3 | 0.5% |
Modifier Letters
| Value | Count | Frequency (%) |
| ʹ | 26 | |
| ˚ | 14 | |
| ʺ | 1 | 2.4% |
Geometric Shapes
| Value | Count | Frequency (%) |
| ◦ | 2 |
Missing 
| Distinct | 13853 |
|---|---|
| Distinct (%) | 19.4% |
| Missing | 1854475 |
| Missing (%) | 96.3% |
| Memory size | 14.7 MiB |
Length
| Max length | 59 |
|---|---|
| Median length | 40 |
| Mean length | 10.07856285 |
| Min length | 2 |
Unique
| Unique | 6900 ? |
|---|---|
| Unique (%) | 9.6% |
Sample
| 1st row | -68.899058 |
|---|---|
| 2nd row | 139 13.45 E |
| 3rd row | -157.801784 |
| 4th row | 85 54.5 W |
| 5th row | -121.1546 |
| Value | Count | Frequency (%) |
| w | 42754 | 22.5% |
| 84 | 7563 | 4.0% |
| e | 6274 | 3.3% |
| 00 | 3967 | 2.1% |
| 83 | 3204 | 1.7% |
| 86 | 2758 | 1.5% |
| 85 | 1732 | 0.9% |
| 53 | 1629 | 0.9% |
| 79 | 1576 | 0.8% |
| 17 | 1286 | 0.7% |
| Other values (8986) | 117328 |
Most occurring characters
| Value | Count | Frequency (%) |
| 118485 | ||
| 0 | 65196 | |
| 1 | 63938 | |
| 8 | 51373 | 7.1% |
| W | 49328 | 6.8% |
| . | 48187 | 6.7% |
| 5 | 47280 | 6.6% |
| 2 | 42177 | 5.8% |
| 3 | 42152 | 5.8% |
| 4 | 41303 | 5.7% |
| Other values (52) | 152065 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 456784 | |
| Space Separator | 118485 | 16.4% |
| Uppercase Letter | 59224 | 8.2% |
| Other Punctuation | 56936 | 7.9% |
| Dash Punctuation | 15722 | 2.2% |
| Lowercase Letter | 8267 | 1.1% |
| Other Symbol | 5138 | 0.7% |
| Other Letter | 470 | 0.1% |
| Connector Punctuation | 220 | < 0.1% |
| Final Punctuation | 107 | < 0.1% |
| Other values (4) | 131 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 1217 | |
| o | 1165 | |
| e | 1093 | |
| d | 946 | |
| g | 936 | |
| t | 908 | |
| i | 901 | |
| u | 897 | |
| a | 69 | 0.8% |
| r | 47 | 0.6% |
| Other values (7) | 88 | 1.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 65196 | |
| 1 | 63938 | |
| 8 | 51373 | |
| 5 | 47280 | |
| 2 | 42177 | |
| 3 | 42152 | |
| 4 | 41303 | |
| 7 | 37164 | |
| 6 | 34447 | |
| 9 | 31754 |
Uppercase Letter
| Value | Count | Frequency (%) |
| W | 49328 | |
| E | 7724 | 13.0% |
| L | 1158 | 2.0% |
| D | 440 | 0.7% |
| S | 222 | 0.4% |
| G | 220 | 0.4% |
| N | 75 | 0.1% |
| M | 54 | 0.1% |
| A | 2 | < 0.1% |
| R | 1 | < 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 48187 | |
| ' | 4398 | 7.7% |
| ; | 2607 | 4.6% |
| " | 984 | 1.7% |
| ′ | 398 | 0.7% |
| * | 149 | 0.3% |
| ″ | 102 | 0.2% |
| ? | 72 | 0.1% |
| , | 25 | < 0.1% |
| / | 14 | < 0.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 15719 | |
| – | 3 | < 0.1% |
Other Symbol
| Value | Count | Frequency (%) |
| ° | 5136 | |
| ◦ | 2 | < 0.1% |
Modifier Symbol
| Value | Count | Frequency (%) |
| ´ | 82 | |
| ˚ | 14 | 14.6% |
Modifier Letter
| Value | Count | Frequency (%) |
| ʹ | 25 | |
| ʺ | 1 | 3.8% |
Math Symbol
| Value | Count | Frequency (%) |
| ~ | 5 | |
| = | 2 | 28.6% |
Space Separator
| Value | Count | Frequency (%) |
| 118485 |
Other Letter
| Value | Count | Frequency (%) |
| º | 470 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 220 |
Final Punctuation
| Value | Count | Frequency (%) |
| ” | 107 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 653523 | |
| Latin | 67961 | 9.4% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 118485 | ||
| 0 | 65196 | |
| 1 | 63938 | |
| 8 | 51373 | |
| . | 48187 | |
| 5 | 47280 | 7.2% |
| 2 | 42177 | 6.5% |
| 3 | 42152 | 6.4% |
| 4 | 41303 | 6.3% |
| 7 | 37164 | 5.7% |
| Other values (24) | 96268 |
Latin
| Value | Count | Frequency (%) |
| W | 49328 | |
| E | 7724 | 11.4% |
| n | 1217 | 1.8% |
| o | 1165 | 1.7% |
| L | 1158 | 1.7% |
| e | 1093 | 1.6% |
| d | 946 | 1.4% |
| g | 936 | 1.4% |
| t | 908 | 1.3% |
| i | 901 | 1.3% |
| Other values (18) | 2585 | 3.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 715144 | |
| None | 5688 | 0.8% |
| Punctuation | 610 | 0.1% |
| Modifier Letters | 40 | < 0.1% |
| Geometric Shapes | 2 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 118485 | ||
| 0 | 65196 | |
| 1 | 63938 | |
| 8 | 51373 | 7.2% |
| W | 49328 | 6.9% |
| . | 48187 | 6.7% |
| 5 | 47280 | 6.6% |
| 2 | 42177 | 5.9% |
| 3 | 42152 | 5.9% |
| 4 | 41303 | 5.8% |
| Other values (41) | 145725 |
None
| Value | Count | Frequency (%) |
| ° | 5136 | |
| º | 470 | 8.3% |
| ´ | 82 | 1.4% |
Punctuation
| Value | Count | Frequency (%) |
| ′ | 398 | |
| ” | 107 | 17.5% |
| ″ | 102 | 16.7% |
| – | 3 | 0.5% |
Modifier Letters
| Value | Count | Frequency (%) |
| ʹ | 25 | |
| ˚ | 14 | |
| ʺ | 1 | 2.5% |
Geometric Shapes
| Value | Count | Frequency (%) |
| ◦ | 2 |
Missing 
| Distinct | 9 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1246668 |
| Missing (%) | 64.7% |
| Memory size | 14.7 MiB |
Length
| Max length | 23 |
|---|---|
| Median length | 23 |
| Mean length | 22.60570097 |
| Min length | 3 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Degrees Minutes Seconds |
|---|---|
| 2nd row | Degrees Minutes Seconds |
| 3rd row | Degrees Minutes Seconds |
| 4th row | Degrees Minutes Seconds |
| 5th row | Degrees Minutes Seconds |
| Value | Count | Frequency (%) |
| degrees | 670787 | |
| minutes | 648088 | |
| seconds | 648088 | |
| decimal | 22699 | 1.1% |
| township | 7003 | 0.3% |
| range | 7003 | 0.3% |
| marsden | 604 | < 0.1% |
| square | 604 | < 0.1% |
| unknown | 532 | < 0.1% |
| utm | 464 | < 0.1% |
| Other values (3) | 6 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 3339448 | |
| s | 1974570 | |
| 1326485 | 8.6% | |
| n | 1312382 | 8.5% |
| g | 677790 | 4.4% |
| i | 677790 | 4.4% |
| r | 671998 | 4.4% |
| d | 671349 | 4.4% |
| D | 670832 | 4.4% |
| c | 670788 | 4.4% |
| Other values (20) | 3364723 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 12047514 | |
| Uppercase Letter | 1984156 | 12.9% |
| Space Separator | 1326485 | 8.6% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 3339448 | |
| s | 1974570 | |
| n | 1312382 | 10.9% |
| g | 677790 | 5.6% |
| i | 677790 | 5.6% |
| r | 671998 | 5.6% |
| d | 671349 | 5.6% |
| c | 670788 | 5.6% |
| o | 655625 | 5.4% |
| u | 648695 | 5.4% |
| Other values (9) | 747079 | 6.2% |
Uppercase Letter
| Value | Count | Frequency (%) |
| D | 670832 | |
| M | 649156 | |
| S | 648692 | |
| T | 7467 | 0.4% |
| R | 7003 | 0.4% |
| U | 998 | 0.1% |
| Q | 3 | < 0.1% |
| A | 2 | < 0.1% |
| F | 2 | < 0.1% |
| G | 1 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 1326485 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 14031670 | |
| Common | 1326485 | 8.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 3339448 | |
| s | 1974570 | |
| n | 1312382 | 9.4% |
| g | 677790 | 4.8% |
| i | 677790 | 4.8% |
| r | 671998 | 4.8% |
| d | 671349 | 4.8% |
| D | 670832 | 4.8% |
| c | 670788 | 4.8% |
| o | 655625 | 4.7% |
| Other values (19) | 2709098 |
Common
| Value | Count | Frequency (%) |
| 1326485 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 15358155 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 3339448 | |
| s | 1974570 | |
| 1326485 | 8.6% | |
| n | 1312382 | 8.5% |
| g | 677790 | 4.4% |
| i | 677790 | 4.4% |
| r | 671998 | 4.4% |
| d | 671349 | 4.4% |
| D | 670832 | 4.4% |
| c | 670788 | 4.4% |
| Other values (20) | 3364723 |
footprintSRS
Text
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 1926059 |
| Missing (%) | > 99.9% |
| Memory size | 14.7 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 7 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Algeria |
|---|---|
| 2nd row | United States |
| Value | Count | Frequency (%) |
| algeria | 1 | |
| united | 1 | |
| states | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 3 | |
| t | 3 | |
| i | 2 | |
| a | 2 | |
| A | 1 | 5.0% |
| l | 1 | 5.0% |
| g | 1 | 5.0% |
| r | 1 | 5.0% |
| U | 1 | 5.0% |
| n | 1 | 5.0% |
| Other values (4) | 4 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 16 | |
| Uppercase Letter | 3 | 15.0% |
| Space Separator | 1 | 5.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 3 | |
| t | 3 | |
| i | 2 | |
| a | 2 | |
| l | 1 | 6.2% |
| g | 1 | 6.2% |
| r | 1 | 6.2% |
| n | 1 | 6.2% |
| d | 1 | 6.2% |
| s | 1 | 6.2% |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 1 | |
| U | 1 | |
| S | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 19 | |
| Common | 1 | 5.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 3 | |
| t | 3 | |
| i | 2 | |
| a | 2 | |
| A | 1 | 5.3% |
| l | 1 | 5.3% |
| g | 1 | 5.3% |
| r | 1 | 5.3% |
| U | 1 | 5.3% |
| n | 1 | 5.3% |
| Other values (3) | 3 |
Common
| Value | Count | Frequency (%) |
| 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 20 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 3 | |
| t | 3 | |
| i | 2 | |
| a | 2 | |
| A | 1 | 5.0% |
| l | 1 | 5.0% |
| g | 1 | 5.0% |
| r | 1 | 5.0% |
| U | 1 | 5.0% |
| n | 1 | 5.0% |
| Other values (4) | 4 |
georeferencedBy
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 1926060 |
| Missing (%) | > 99.9% |
| Memory size | 14.7 MiB |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 5 |
| Min length | 5 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Idaho |
|---|
| Value | Count | Frequency (%) |
| idaho | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| I | 1 | |
| d | 1 | |
| a | 1 | |
| h | 1 | |
| o | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 4 | |
| Uppercase Letter | 1 | 20.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| d | 1 | |
| a | 1 | |
| h | 1 | |
| o | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 5 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| I | 1 | |
| d | 1 | |
| a | 1 | |
| h | 1 | |
| o | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| I | 1 | |
| d | 1 | |
| a | 1 | |
| h | 1 | |
| o | 1 |
Missing 
| Distinct | 113 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1265567 |
| Missing (%) | 65.7% |
| Memory size | 14.7 MiB |
Length
| Max length | 87 |
|---|---|
| Median length | 20 |
| Mean length | 20.10035065 |
| Min length | 3 |
Unique
| Unique | 20 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | unknown, from legacy |
|---|---|
| 2nd row | unknown, from legacy |
| 3rd row | unknown, from legacy |
| 4th row | unknown, from legacy |
| 5th row | unknown, from legacy |
| Value | Count | Frequency (%) |
| from | 508975 | |
| unknown | 507502 | |
| legacy | 505051 | |
| geolocate | 70300 | 3.6% |
| names | 41929 | 2.2% |
| geographic | 41548 | 2.1% |
| of | 35272 | 1.8% |
| getty | 34680 | 1.8% |
| thesaurus | 34679 | 1.8% |
| may | 23185 | 1.2% |
| Other values (125) | 141489 | 7.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 1560576 | 11.8% |
| 1284116 | 9.7% | |
| o | 1253199 | 9.4% |
| e | 821923 | 6.2% |
| a | 796893 | 6.0% |
| r | 641911 | 4.8% |
| c | 624543 | 4.7% |
| g | 591212 | 4.5% |
| u | 580658 | 4.4% |
| y | 577336 | 4.3% |
| Other values (54) | 4543794 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 10759532 | |
| Space Separator | 1284116 | 9.7% |
| Uppercase Letter | 560035 | 4.2% |
| Other Punctuation | 551518 | 4.2% |
| Decimal Number | 114440 | 0.9% |
| Dash Punctuation | 3268 | < 0.1% |
| Close Punctuation | 1624 | < 0.1% |
| Open Punctuation | 1624 | < 0.1% |
| Connector Punctuation | 4 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 1560576 | |
| o | 1253199 | |
| e | 821923 | 7.6% |
| a | 796893 | 7.4% |
| r | 641911 | 6.0% |
| c | 624543 | 5.8% |
| g | 591212 | 5.5% |
| u | 580658 | 5.4% |
| y | 577336 | 5.4% |
| m | 572844 | 5.3% |
| Other values (14) | 2738437 |
Uppercase Letter
| Value | Count | Frequency (%) |
| G | 185792 | |
| L | 76688 | |
| E | 75159 | |
| O | 56827 | 10.1% |
| N | 43892 | 7.8% |
| T | 36730 | 6.6% |
| M | 26360 | 4.7% |
| S | 23936 | 4.3% |
| U | 8296 | 1.5% |
| I | 8275 | 1.5% |
| Other values (9) | 18080 | 3.2% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 52662 | |
| 2 | 49519 | |
| 9 | 5897 | 5.2% |
| 4 | 2925 | 2.6% |
| 1 | 1974 | 1.7% |
| 5 | 1442 | 1.3% |
| 8 | 15 | < 0.1% |
| 7 | 4 | < 0.1% |
| 3 | 2 | < 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 528902 | |
| / | 9410 | 1.7% |
| . | 9109 | 1.7% |
| : | 3482 | 0.6% |
| & | 594 | 0.1% |
| ! | 18 | < 0.1% |
| ' | 3 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 1284116 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 3268 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1624 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1624 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 4 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 11319567 | |
| Common | 1956594 | 14.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| n | 1560576 | |
| o | 1253199 | 11.1% |
| e | 821923 | 7.3% |
| a | 796893 | 7.0% |
| r | 641911 | 5.7% |
| c | 624543 | 5.5% |
| g | 591212 | 5.2% |
| u | 580658 | 5.1% |
| y | 577336 | 5.1% |
| m | 572844 | 5.1% |
| Other values (33) | 3298472 |
Common
| Value | Count | Frequency (%) |
| 1284116 | ||
| , | 528902 | |
| 0 | 52662 | 2.7% |
| 2 | 49519 | 2.5% |
| / | 9410 | 0.5% |
| . | 9109 | 0.5% |
| 9 | 5897 | 0.3% |
| : | 3482 | 0.2% |
| - | 3268 | 0.2% |
| 4 | 2925 | 0.1% |
| Other values (11) | 7304 | 0.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 13276161 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| n | 1560576 | 11.8% |
| 1284116 | 9.7% | |
| o | 1253199 | 9.4% |
| e | 821923 | 6.2% |
| a | 796893 | 6.0% |
| r | 641911 | 4.8% |
| c | 624543 | 4.7% |
| g | 591212 | 4.5% |
| u | 580658 | 4.4% |
| y | 577336 | 4.3% |
| Other values (54) | 4543794 |
Missing 
| Distinct | 3 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 1926058 |
| Missing (%) | > 99.9% |
| Memory size | 14.7 MiB |
Length
| Max length | 26 |
|---|---|
| Median length | 13 |
| Mean length | 15.66666667 |
| Min length | 8 |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Boutaara |
|---|---|
| 2nd row | Beveridge, I. |
| 3rd row | Denton, J. F.; Byrd, E. E. |
| Value | Count | Frequency (%) |
| e | 2 | |
| boutaara | 1 | |
| beveridge | 1 | |
| i | 1 | |
| denton | 1 | |
| j | 1 | |
| f | 1 | |
| byrd | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 6 | ||
| . | 5 | 10.6% |
| e | 4 | 8.5% |
| B | 3 | 6.4% |
| r | 3 | 6.4% |
| , | 3 | 6.4% |
| a | 3 | 6.4% |
| d | 2 | 4.3% |
| o | 2 | 4.3% |
| t | 2 | 4.3% |
| Other values (12) | 14 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 23 | |
| Other Punctuation | 9 | 19.1% |
| Uppercase Letter | 9 | 19.1% |
| Space Separator | 6 | 12.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 4 | |
| r | 3 | |
| a | 3 | |
| d | 2 | |
| o | 2 | |
| t | 2 | |
| n | 2 | |
| v | 1 | 4.3% |
| i | 1 | 4.3% |
| g | 1 | 4.3% |
| Other values (2) | 2 |
Uppercase Letter
| Value | Count | Frequency (%) |
| B | 3 | |
| E | 2 | |
| I | 1 | 11.1% |
| D | 1 | 11.1% |
| J | 1 | 11.1% |
| F | 1 | 11.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 5 | |
| , | 3 | |
| ; | 1 | 11.1% |
Space Separator
| Value | Count | Frequency (%) |
| 6 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 32 | |
| Common | 15 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 4 | |
| B | 3 | 9.4% |
| r | 3 | 9.4% |
| a | 3 | 9.4% |
| d | 2 | 6.2% |
| o | 2 | 6.2% |
| t | 2 | 6.2% |
| n | 2 | 6.2% |
| E | 2 | 6.2% |
| v | 1 | 3.1% |
| Other values (8) | 8 |
Common
| Value | Count | Frequency (%) |
| 6 | ||
| . | 5 | |
| , | 3 | |
| ; | 1 | 6.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 47 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 6 | ||
| . | 5 | 10.6% |
| e | 4 | 8.5% |
| B | 3 | 6.4% |
| r | 3 | 6.4% |
| , | 3 | 6.4% |
| a | 3 | 6.4% |
| d | 2 | 4.3% |
| o | 2 | 4.3% |
| t | 2 | 4.3% |
| Other values (12) | 14 |
Missing 
| Distinct | 4818 |
|---|---|
| Distinct (%) | 15.9% |
| Missing | 1895791 |
| Missing (%) | 98.4% |
| Memory size | 14.7 MiB |
Length
| Max length | 122 |
|---|---|
| Median length | 118 |
| Mean length | 23.035778 |
| Min length | 1 |
Unique
| Unique | 3162 ? |
|---|---|
| Unique (%) | 10.4% |
Sample
| 1st row | Extended About 16 Km Offshore From Crystal River Power Plant |
|---|---|
| 2nd row | 0.8 mile west of Montgomery-Polk county line, north side of |
| 3rd row | San Andreas Fault |
| 4th row | 6 Mile W Of Watsonville |
| 5th row | from Holt data card |
| Value | Count | Frequency (%) |
| approximate | 9788 | 9.0% |
| from | 6466 | 5.9% |
| river | 3462 | 3.2% |
| of | 3094 | 2.8% |
| about | 3075 | 2.8% |
| 16 | 2973 | 2.7% |
| km | 2969 | 2.7% |
| plant | 2932 | 2.7% |
| offshore | 2928 | 2.7% |
| power | 2928 | 2.7% |
| Other values (4966) | 68697 |
Most occurring characters
| Value | Count | Frequency (%) |
| 79042 | 11.3% | |
| a | 60472 | 8.7% |
| e | 55627 | 8.0% |
| o | 49156 | 7.0% |
| r | 47472 | 6.8% |
| t | 40214 | 5.8% |
| i | 29459 | 4.2% |
| n | 26669 | 3.8% |
| p | 24667 | 3.5% |
| m | 24216 | 3.5% |
| Other values (68) | 260299 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 519508 | |
| Space Separator | 79042 | 11.3% |
| Uppercase Letter | 71753 | 10.3% |
| Decimal Number | 14975 | 2.1% |
| Other Punctuation | 10482 | 1.5% |
| Close Punctuation | 574 | 0.1% |
| Open Punctuation | 570 | 0.1% |
| Dash Punctuation | 354 | 0.1% |
| Math Symbol | 35 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 60472 | |
| e | 55627 | |
| o | 49156 | 9.5% |
| r | 47472 | 9.1% |
| t | 40214 | 7.7% |
| i | 29459 | 5.7% |
| n | 26669 | 5.1% |
| p | 24667 | 4.7% |
| m | 24216 | 4.7% |
| l | 23875 | 4.6% |
| Other values (16) | 137681 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 8825 | |
| R | 7367 | |
| C | 6870 | 9.6% |
| O | 6475 | 9.0% |
| B | 4745 | 6.6% |
| A | 4362 | 6.1% |
| F | 4158 | 5.8% |
| E | 3946 | 5.5% |
| S | 3869 | 5.4% |
| K | 3631 | 5.1% |
| Other values (16) | 17505 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 4138 | |
| 6 | 3402 | |
| 5 | 1659 | |
| 0 | 1440 | 9.6% |
| 3 | 1434 | 9.6% |
| 2 | 951 | 6.4% |
| 4 | 876 | 5.8% |
| 7 | 488 | 3.3% |
| 8 | 411 | 2.7% |
| 9 | 176 | 1.2% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 5315 | |
| . | 2055 | 19.6% |
| / | 1922 | 18.3% |
| : | 460 | 4.4% |
| ' | 363 | 3.5% |
| ; | 283 | 2.7% |
| " | 42 | 0.4% |
| & | 23 | 0.2% |
| # | 19 | 0.2% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 564 | |
| ] | 10 | 1.7% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 560 | |
| [ | 10 | 1.8% |
Space Separator
| Value | Count | Frequency (%) |
| 79042 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 354 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 35 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 591261 | |
| Common | 106032 | 15.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 60472 | 10.2% |
| e | 55627 | 9.4% |
| o | 49156 | 8.3% |
| r | 47472 | 8.0% |
| t | 40214 | 6.8% |
| i | 29459 | 5.0% |
| n | 26669 | 4.5% |
| p | 24667 | 4.2% |
| m | 24216 | 4.1% |
| l | 23875 | 4.0% |
| Other values (42) | 209434 |
Common
| Value | Count | Frequency (%) |
| 79042 | ||
| , | 5315 | 5.0% |
| 1 | 4138 | 3.9% |
| 6 | 3402 | 3.2% |
| . | 2055 | 1.9% |
| / | 1922 | 1.8% |
| 5 | 1659 | 1.6% |
| 0 | 1440 | 1.4% |
| 3 | 1434 | 1.4% |
| 2 | 951 | 0.9% |
| Other values (16) | 4674 | 4.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 697293 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 79042 | 11.3% | |
| a | 60472 | 8.7% |
| e | 55627 | 8.0% |
| o | 49156 | 7.0% |
| r | 47472 | 6.8% |
| t | 40214 | 5.8% |
| i | 29459 | 4.2% |
| n | 26669 | 3.8% |
| p | 24667 | 3.5% |
| m | 24216 | 3.5% |
| Other values (68) | 260299 |
Missing 
| Distinct | 3 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 1926058 |
| Missing (%) | > 99.9% |
| Memory size | 14.7 MiB |
Length
| Max length | 75 |
|---|---|
| Median length | 37 |
| Mean length | 46.66666667 |
| Min length | 28 |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | North America, North Pacific Ocean, Departure Bay, Canada, British Columbia |
|---|---|
| 2nd row | North America, United States, Georgia |
| 3rd row | North America, United States |
| Value | Count | Frequency (%) |
| north | 4 | |
| america | 3 | |
| united | 2 | |
| states | 2 | |
| pacific | 1 | 5.3% |
| ocean | 1 | 5.3% |
| departure | 1 | 5.3% |
| bay | 1 | 5.3% |
| canada | 1 | 5.3% |
| british | 1 | 5.3% |
| Other values (2) | 2 |
Most occurring characters
| Value | Count | Frequency (%) |
| 16 | 11.4% | |
| a | 14 | 10.0% |
| t | 12 | 8.6% |
| r | 11 | 7.9% |
| e | 11 | 7.9% |
| i | 11 | 7.9% |
| , | 7 | 5.0% |
| o | 6 | 4.3% |
| c | 6 | 4.3% |
| h | 5 | 3.6% |
| Other values (21) | 41 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 98 | |
| Uppercase Letter | 19 | 13.6% |
| Space Separator | 16 | 11.4% |
| Other Punctuation | 7 | 5.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 14 | |
| t | 12 | |
| r | 11 | |
| e | 11 | |
| i | 11 | |
| o | 6 | |
| c | 6 | |
| h | 5 | 5.1% |
| n | 4 | 4.1% |
| m | 4 | 4.1% |
| Other values (9) | 14 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 4 | |
| A | 3 | |
| S | 2 | |
| U | 2 | |
| B | 2 | |
| C | 2 | |
| G | 1 | 5.3% |
| O | 1 | 5.3% |
| D | 1 | 5.3% |
| P | 1 | 5.3% |
Space Separator
| Value | Count | Frequency (%) |
| 16 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 7 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 117 | |
| Common | 23 | 16.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 14 | |
| t | 12 | 10.3% |
| r | 11 | 9.4% |
| e | 11 | 9.4% |
| i | 11 | 9.4% |
| o | 6 | 5.1% |
| c | 6 | 5.1% |
| h | 5 | 4.3% |
| n | 4 | 3.4% |
| N | 4 | 3.4% |
| Other values (19) | 33 |
Common
| Value | Count | Frequency (%) |
| 16 | ||
| , | 7 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 140 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 16 | 11.4% | |
| a | 14 | 10.0% |
| t | 12 | 8.6% |
| r | 11 | 7.9% |
| e | 11 | 7.9% |
| i | 11 | 7.9% |
| , | 7 | 5.0% |
| o | 6 | 4.3% |
| c | 6 | 4.3% |
| h | 5 | 3.6% |
| Other values (21) | 41 |
earliestEonOrLowestEonothem
Text
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 66.7% |
| Missing | 1926058 |
| Missing (%) | > 99.9% |
| Memory size | 14.7 MiB |
Length
| Max length | 34 |
|---|---|
| Median length | 13 |
| Mean length | 20 |
| Min length | 13 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 33.3% |
Sample
| 1st row | North America, North Pacific Ocean |
|---|---|
| 2nd row | North America |
| 3rd row | North America |
| Value | Count | Frequency (%) |
| north | 4 | |
| america | 3 | |
| pacific | 1 | 11.1% |
| ocean | 1 | 11.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| r | 7 | |
| c | 6 | |
| 6 | ||
| a | 5 | |
| i | 5 | |
| N | 4 | 6.7% |
| o | 4 | 6.7% |
| e | 4 | 6.7% |
| h | 4 | 6.7% |
| t | 4 | 6.7% |
| Other values (7) | 11 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 44 | |
| Uppercase Letter | 9 | 15.0% |
| Space Separator | 6 | 10.0% |
| Other Punctuation | 1 | 1.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 7 | |
| c | 6 | |
| a | 5 | |
| i | 5 | |
| o | 4 | |
| e | 4 | |
| h | 4 | |
| t | 4 | |
| m | 3 | |
| f | 1 | 2.3% |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 4 | |
| A | 3 | |
| P | 1 | 11.1% |
| O | 1 | 11.1% |
Space Separator
| Value | Count | Frequency (%) |
| 6 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 53 | |
| Common | 7 | 11.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| r | 7 | |
| c | 6 | |
| a | 5 | |
| i | 5 | |
| N | 4 | |
| o | 4 | |
| e | 4 | |
| h | 4 | |
| t | 4 | |
| m | 3 | |
| Other values (5) | 7 |
Common
| Value | Count | Frequency (%) |
| 6 | ||
| , | 1 | 14.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 60 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| r | 7 | |
| c | 6 | |
| 6 | ||
| a | 5 | |
| i | 5 | |
| N | 4 | 6.7% |
| o | 4 | 6.7% |
| e | 4 | 6.7% |
| h | 4 | 6.7% |
| t | 4 | 6.7% |
| Other values (7) | 11 |
latestEonOrHighestEonothem
Text
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 1926059 |
| Missing (%) | > 99.9% |
| Memory size | 14.7 MiB |
Length
| Max length | 34 |
|---|---|
| Median length | 20.5 |
| Mean length | 20.5 |
| Min length | 7 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | North Pacific Ocean, Departure Bay |
|---|---|
| 2nd row | AL-1419 |
| Value | Count | Frequency (%) |
| north | 1 | |
| pacific | 1 | |
| ocean | 1 | |
| departure | 1 | |
| bay | 1 | |
| al-1419 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 4 | 9.8% | |
| a | 4 | 9.8% |
| r | 3 | 7.3% |
| c | 3 | 7.3% |
| e | 3 | 7.3% |
| t | 2 | 4.9% |
| 1 | 2 | 4.9% |
| i | 2 | 4.9% |
| N | 1 | 2.4% |
| u | 1 | 2.4% |
| Other values (16) | 16 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 24 | |
| Uppercase Letter | 7 | 17.1% |
| Space Separator | 4 | 9.8% |
| Decimal Number | 4 | 9.8% |
| Dash Punctuation | 1 | 2.4% |
| Other Punctuation | 1 | 2.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 4 | |
| r | 3 | |
| c | 3 | |
| e | 3 | |
| t | 2 | |
| i | 2 | |
| u | 1 | 4.2% |
| y | 1 | 4.2% |
| n | 1 | 4.2% |
| p | 1 | 4.2% |
| Other values (3) | 3 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 1 | |
| L | 1 | |
| A | 1 | |
| B | 1 | |
| D | 1 | |
| O | 1 | |
| P | 1 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 2 | |
| 4 | 1 | |
| 9 | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 4 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 31 | |
| Common | 10 | 24.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 4 | 12.9% |
| r | 3 | 9.7% |
| c | 3 | 9.7% |
| e | 3 | 9.7% |
| t | 2 | 6.5% |
| i | 2 | 6.5% |
| N | 1 | 3.2% |
| u | 1 | 3.2% |
| L | 1 | 3.2% |
| A | 1 | 3.2% |
| Other values (10) | 10 |
Common
| Value | Count | Frequency (%) |
| 4 | ||
| 1 | 2 | |
| 4 | 1 | 10.0% |
| - | 1 | 10.0% |
| , | 1 | 10.0% |
| 9 | 1 | 10.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 41 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 4 | 9.8% | |
| a | 4 | 9.8% |
| r | 3 | 7.3% |
| c | 3 | 7.3% |
| e | 3 | 7.3% |
| t | 2 | 4.9% |
| 1 | 2 | 4.9% |
| i | 2 | 4.9% |
| N | 1 | 2.4% |
| u | 1 | 2.4% |
| Other values (16) | 16 |
earliestEraOrLowestErathem
Text
Missing 
| Distinct | 9 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 1926052 |
| Missing (%) | > 99.9% |
| Memory size | 14.7 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 8.666666667 |
| Min length | 4 |
Unique
| Unique | 9 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 1911-09-29 |
|---|---|
| 2nd row | 1984-04-14 |
| 3rd row | 1997-04-24 |
| 4th row | 1962-06-19 |
| 5th row | 1935-06-26 |
| Value | Count | Frequency (%) |
| 1911-09-29 | 1 | |
| 1984-04-14 | 1 | |
| 1997-04-24 | 1 | |
| 1962-06-19 | 1 | |
| 1935-06-26 | 1 | |
| 1984-07-25 | 1 | |
| 1931 | 1 | |
| 1935-07-15 | 1 | |
| 1957 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 15 | |
| - | 14 | |
| 9 | 13 | |
| 0 | 7 | |
| 4 | 6 | 7.7% |
| 2 | 5 | 6.4% |
| 5 | 5 | 6.4% |
| 7 | 4 | 5.1% |
| 6 | 4 | 5.1% |
| 3 | 3 | 3.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 64 | |
| Dash Punctuation | 14 | 17.9% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 15 | |
| 9 | 13 | |
| 0 | 7 | |
| 4 | 6 | 9.4% |
| 2 | 5 | 7.8% |
| 5 | 5 | 7.8% |
| 7 | 4 | 6.2% |
| 6 | 4 | 6.2% |
| 3 | 3 | 4.7% |
| 8 | 2 | 3.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 14 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 78 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 15 | |
| - | 14 | |
| 9 | 13 | |
| 0 | 7 | |
| 4 | 6 | 7.7% |
| 2 | 5 | 6.4% |
| 5 | 5 | 6.4% |
| 7 | 4 | 5.1% |
| 6 | 4 | 5.1% |
| 3 | 3 | 3.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 78 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 15 | |
| - | 14 | |
| 9 | 13 | |
| 0 | 7 | |
| 4 | 6 | 7.7% |
| 2 | 5 | 6.4% |
| 5 | 5 | 6.4% |
| 7 | 4 | 5.1% |
| 6 | 4 | 5.1% |
| 3 | 3 | 3.8% |
earliestPeriodOrLowestSystem
Text
Missing 
| Distinct | 9 |
|---|---|
| Distinct (%) | 90.0% |
| Missing | 1926051 |
| Missing (%) | > 99.9% |
| Memory size | 14.7 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 3 |
| Mean length | 5.3 |
| Min length | 3 |
Unique
| Unique | 8 ? |
|---|---|
| Unique (%) | 80.0% |
Sample
| 1st row | 272 |
|---|---|
| 2nd row | 105 |
| 3rd row | Canada |
| 4th row | 114 |
| 5th row | 170 |
| Value | Count | Frequency (%) |
| united | 2 | |
| states | 2 | |
| 272 | 1 | |
| 105 | 1 | |
| canada | 1 | |
| 114 | 1 | |
| 170 | 1 | |
| 177 | 1 | |
| 207 | 1 | |
| 196 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 6 | |
| 1 | 6 | |
| 7 | 5 | 9.4% |
| a | 5 | 9.4% |
| e | 4 | 7.5% |
| 2 | 3 | 5.7% |
| 0 | 3 | 5.7% |
| d | 3 | 5.7% |
| n | 3 | 5.7% |
| U | 2 | 3.8% |
| Other values (9) | 13 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 25 | |
| Decimal Number | 21 | |
| Uppercase Letter | 5 | 9.4% |
| Space Separator | 2 | 3.8% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 6 | |
| 7 | 5 | |
| 2 | 3 | |
| 0 | 3 | |
| 5 | 1 | 4.8% |
| 4 | 1 | 4.8% |
| 9 | 1 | 4.8% |
| 6 | 1 | 4.8% |
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 6 | |
| a | 5 | |
| e | 4 | |
| d | 3 | |
| n | 3 | |
| s | 2 | 8.0% |
| i | 2 | 8.0% |
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 2 | |
| S | 2 | |
| C | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 30 | |
| Common | 23 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 6 | |
| a | 5 | |
| e | 4 | |
| d | 3 | |
| n | 3 | |
| U | 2 | 6.7% |
| s | 2 | 6.7% |
| S | 2 | 6.7% |
| i | 2 | 6.7% |
| C | 1 | 3.3% |
Common
| Value | Count | Frequency (%) |
| 1 | 6 | |
| 7 | 5 | |
| 2 | 3 | |
| 0 | 3 | |
| 2 | 8.7% | |
| 5 | 1 | 4.3% |
| 4 | 1 | 4.3% |
| 9 | 1 | 4.3% |
| 6 | 1 | 4.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 53 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| t | 6 | |
| 1 | 6 | |
| 7 | 5 | 9.4% |
| a | 5 | 9.4% |
| e | 4 | 7.5% |
| 2 | 3 | 5.7% |
| 0 | 3 | 5.7% |
| d | 3 | 5.7% |
| n | 3 | 5.7% |
| U | 2 | 3.8% |
| Other values (9) | 13 |
latestPeriodOrHighestSystem
Text
Missing 
| Distinct | 7 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 1926054 |
| Missing (%) | > 99.9% |
| Memory size | 14.7 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 7 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 272 |
|---|---|
| 2nd row | 105 |
| 3rd row | 114 |
| 4th row | 170 |
| 5th row | 177 |
| Value | Count | Frequency (%) |
| 272 | 1 | |
| 105 | 1 | |
| 114 | 1 | |
| 170 | 1 | |
| 177 | 1 | |
| 207 | 1 | |
| 196 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 6 | |
| 7 | 5 | |
| 2 | 3 | |
| 0 | 3 | |
| 5 | 1 | 4.8% |
| 4 | 1 | 4.8% |
| 9 | 1 | 4.8% |
| 6 | 1 | 4.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 21 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 6 | |
| 7 | 5 | |
| 2 | 3 | |
| 0 | 3 | |
| 5 | 1 | 4.8% |
| 4 | 1 | 4.8% |
| 9 | 1 | 4.8% |
| 6 | 1 | 4.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 21 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 6 | |
| 7 | 5 | |
| 2 | 3 | |
| 0 | 3 | |
| 5 | 1 | 4.8% |
| 4 | 1 | 4.8% |
| 9 | 1 | 4.8% |
| 6 | 1 | 4.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 21 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 6 | |
| 7 | 5 | |
| 2 | 3 | |
| 0 | 3 | |
| 5 | 1 | 4.8% |
| 4 | 1 | 4.8% |
| 9 | 1 | 4.8% |
| 6 | 1 | 4.8% |
earliestEpochOrLowestSeries
Text
Missing 
| Distinct | 9 |
|---|---|
| Distinct (%) | 81.8% |
| Missing | 1926050 |
| Missing (%) | > 99.9% |
| Memory size | 14.7 MiB |
Length
| Max length | 16 |
|---|---|
| Median length | 4 |
| Mean length | 5.363636364 |
| Min length | 4 |
Unique
| Unique | 7 ? |
|---|---|
| Unique (%) | 63.6% |
Sample
| 1st row | 1911 |
|---|---|
| 2nd row | 1984 |
| 3rd row | British Columbia |
| 4th row | 1997 |
| 5th row | 1962 |
| Value | Count | Frequency (%) |
| 1984 | 2 | |
| 1935 | 2 | |
| 1911 | 1 | |
| british | 1 | |
| columbia | 1 | |
| 1997 | 1 | |
| 1962 | 1 | |
| georgia | 1 | |
| 1931 | 1 | |
| 1957 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 12 | |
| 9 | 10 | |
| i | 4 | 6.8% |
| 3 | 3 | 5.1% |
| 5 | 3 | 5.1% |
| 4 | 2 | 3.4% |
| r | 2 | 3.4% |
| 7 | 2 | 3.4% |
| 8 | 2 | 3.4% |
| o | 2 | 3.4% |
| Other values (16) | 17 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 36 | |
| Lowercase Letter | 19 | |
| Uppercase Letter | 3 | 5.1% |
| Space Separator | 1 | 1.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 4 | |
| r | 2 | |
| o | 2 | |
| a | 2 | |
| e | 1 | 5.3% |
| m | 1 | 5.3% |
| b | 1 | 5.3% |
| u | 1 | 5.3% |
| l | 1 | 5.3% |
| h | 1 | 5.3% |
| Other values (3) | 3 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 12 | |
| 9 | 10 | |
| 3 | 3 | 8.3% |
| 5 | 3 | 8.3% |
| 4 | 2 | 5.6% |
| 7 | 2 | 5.6% |
| 8 | 2 | 5.6% |
| 2 | 1 | 2.8% |
| 6 | 1 | 2.8% |
Uppercase Letter
| Value | Count | Frequency (%) |
| G | 1 | |
| C | 1 | |
| B | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 37 | |
| Latin | 22 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 4 | |
| r | 2 | 9.1% |
| o | 2 | 9.1% |
| a | 2 | 9.1% |
| e | 1 | 4.5% |
| m | 1 | 4.5% |
| G | 1 | 4.5% |
| b | 1 | 4.5% |
| C | 1 | 4.5% |
| u | 1 | 4.5% |
| Other values (6) | 6 |
Common
| Value | Count | Frequency (%) |
| 1 | 12 | |
| 9 | 10 | |
| 3 | 3 | 8.1% |
| 5 | 3 | 8.1% |
| 4 | 2 | 5.4% |
| 7 | 2 | 5.4% |
| 8 | 2 | 5.4% |
| 2 | 1 | 2.7% |
| 6 | 1 | 2.7% |
| 1 | 2.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 59 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 12 | |
| 9 | 10 | |
| i | 4 | 6.8% |
| 3 | 3 | 5.1% |
| 5 | 3 | 5.1% |
| 4 | 2 | 3.4% |
| r | 2 | 3.4% |
| 7 | 2 | 3.4% |
| 8 | 2 | 3.4% |
| o | 2 | 3.4% |
| Other values (16) | 17 |
latestEpochOrHighestSeries
Text
Missing 
| Distinct | 4 |
|---|---|
| Distinct (%) | 57.1% |
| Missing | 1926054 |
| Missing (%) | > 99.9% |
| Memory size | 14.7 MiB |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 14.3% |
Sample
| 1st row | 9 |
|---|---|
| 2nd row | 4 |
| 3rd row | 4 |
| 4th row | 6 |
| 5th row | 6 |
| Value | Count | Frequency (%) |
| 4 | 2 | |
| 6 | 2 | |
| 7 | 2 | |
| 9 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 4 | 2 | |
| 6 | 2 | |
| 7 | 2 | |
| 9 | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 7 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 2 | |
| 6 | 2 | |
| 7 | 2 | |
| 9 | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 7 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 4 | 2 | |
| 6 | 2 | |
| 7 | 2 | |
| 9 | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 7 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 4 | 2 | |
| 6 | 2 | |
| 7 | 2 | |
| 9 | 1 |
Missing 
| Distinct | 7 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 1926054 |
| Missing (%) | > 99.9% |
| Memory size | 14.7 MiB |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Unique
| Unique | 7 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 29 |
|---|---|
| 2nd row | 14 |
| 3rd row | 24 |
| 4th row | 19 |
| 5th row | 26 |
| Value | Count | Frequency (%) |
| 29 | 1 | |
| 14 | 1 | |
| 24 | 1 | |
| 19 | 1 | |
| 26 | 1 | |
| 25 | 1 | |
| 15 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 4 | |
| 1 | 3 | |
| 9 | 2 | |
| 4 | 2 | |
| 5 | 2 | |
| 6 | 1 | 7.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 14 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 4 | |
| 1 | 3 | |
| 9 | 2 | |
| 4 | 2 | |
| 5 | 2 | |
| 6 | 1 | 7.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 14 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 4 | |
| 1 | 3 | |
| 9 | 2 | |
| 4 | 2 | |
| 5 | 2 | |
| 6 | 1 | 7.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 14 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 4 | |
| 1 | 3 | |
| 9 | 2 | |
| 4 | 2 | |
| 5 | 2 | |
| 6 | 1 | 7.1% |
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 1926060 |
| Missing (%) | > 99.9% |
| Memory size | 14.7 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 8 |
| Min length | 8 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Moultrie |
|---|
| Value | Count | Frequency (%) |
| moultrie | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| M | 1 | |
| o | 1 | |
| u | 1 | |
| l | 1 | |
| t | 1 | |
| r | 1 | |
| i | 1 | |
| e | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 7 | |
| Uppercase Letter | 1 | 12.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 1 | |
| u | 1 | |
| l | 1 | |
| t | 1 | |
| r | 1 | |
| i | 1 | |
| e | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| M | 1 | |
| o | 1 | |
| u | 1 | |
| l | 1 | |
| t | 1 | |
| r | 1 | |
| i | 1 | |
| e | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| M | 1 | |
| o | 1 | |
| u | 1 | |
| l | 1 | |
| t | 1 | |
| r | 1 | |
| i | 1 | |
| e | 1 |
highestBiostratigraphicZone
Text
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 1926059 |
| Missing (%) | > 99.9% |
| Memory size | 14.7 MiB |
Length
| Max length | 22 |
|---|---|
| Median length | 20.5 |
| Mean length | 20.5 |
| Min length | 19 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Hemionchos striatus |
|---|---|
| 2nd row | Conspicuum icteridorum |
| Value | Count | Frequency (%) |
| hemionchos | 1 | |
| striatus | 1 | |
| conspicuum | 1 | |
| icteridorum | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 5 | |
| s | 4 | |
| u | 4 | |
| o | 4 | |
| m | 3 | 7.3% |
| c | 3 | 7.3% |
| t | 3 | 7.3% |
| r | 3 | 7.3% |
| n | 2 | 4.9% |
| e | 2 | 4.9% |
| Other values (7) | 8 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 37 | |
| Space Separator | 2 | 4.9% |
| Uppercase Letter | 2 | 4.9% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 5 | |
| s | 4 | |
| u | 4 | |
| o | 4 | |
| m | 3 | |
| c | 3 | |
| t | 3 | |
| r | 3 | |
| n | 2 | 5.4% |
| e | 2 | 5.4% |
| Other values (4) | 4 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 1 | |
| H | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 39 | |
| Common | 2 | 4.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 5 | |
| s | 4 | |
| u | 4 | |
| o | 4 | |
| m | 3 | |
| c | 3 | |
| t | 3 | |
| r | 3 | |
| n | 2 | 5.1% |
| e | 2 | 5.1% |
| Other values (6) | 6 |
Common
| Value | Count | Frequency (%) |
| 2 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 41 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 5 | |
| s | 4 | |
| u | 4 | |
| o | 4 | |
| m | 3 | 7.3% |
| c | 3 | 7.3% |
| t | 3 | 7.3% |
| r | 3 | 7.3% |
| n | 2 | 4.9% |
| e | 2 | 4.9% |
| Other values (7) | 8 |
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 1926059 |
| Missing (%) | > 99.9% |
| Memory size | 14.7 MiB |
Length
| Max length | 76 |
|---|---|
| Median length | 55 |
| Mean length | 55 |
| Min length | 34 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Animalia, Platyhelminthes, Cestoda |
|---|---|
| 2nd row | Animalia, Platyhelminthes, Trematoda, Digenea, Plagiorchiida, Dicrocoeliidae |
| Value | Count | Frequency (%) |
| animalia | 2 | |
| platyhelminthes | 2 | |
| cestoda | 1 | |
| trematoda | 1 | |
| digenea | 1 | |
| plagiorchiida | 1 | |
| dicrocoeliidae | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 13 | |
| a | 13 | |
| e | 10 | 9.1% |
| l | 8 | 7.3% |
| , | 7 | 6.4% |
| 7 | 6.4% | |
| t | 6 | 5.5% |
| h | 5 | 4.5% |
| o | 5 | 4.5% |
| m | 5 | 4.5% |
| Other values (12) | 31 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 87 | |
| Uppercase Letter | 9 | 8.2% |
| Other Punctuation | 7 | 6.4% |
| Space Separator | 7 | 6.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 13 | |
| a | 13 | |
| e | 10 | |
| l | 8 | |
| t | 6 | |
| h | 5 | 5.7% |
| o | 5 | 5.7% |
| m | 5 | 5.7% |
| n | 5 | 5.7% |
| d | 4 | 4.6% |
| Other values (5) | 13 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 3 | |
| D | 2 | |
| A | 2 | |
| C | 1 | 11.1% |
| T | 1 | 11.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 7 |
Space Separator
| Value | Count | Frequency (%) |
| 7 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 96 | |
| Common | 14 | 12.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 13 | |
| a | 13 | |
| e | 10 | |
| l | 8 | 8.3% |
| t | 6 | 6.2% |
| h | 5 | 5.2% |
| o | 5 | 5.2% |
| m | 5 | 5.2% |
| n | 5 | 5.2% |
| d | 4 | 4.2% |
| Other values (10) | 22 |
Common
| Value | Count | Frequency (%) |
| , | 7 | |
| 7 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 110 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 13 | |
| a | 13 | |
| e | 10 | 9.1% |
| l | 8 | 7.3% |
| , | 7 | 6.4% |
| 7 | 6.4% | |
| t | 6 | 5.5% |
| h | 5 | 4.5% |
| o | 5 | 4.5% |
| m | 5 | 4.5% |
| Other values (12) | 31 |
Missing 
| Distinct | 16 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 1907923 |
| Missing (%) | 99.1% |
| Memory size | 14.7 MiB |
Length
| Max length | 50 |
|---|---|
| Median length | 3 |
| Mean length | 3.562189878 |
| Min length | 3 |
Unique
| Unique | 10 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | cf. |
|---|---|
| 2nd row | cf. |
| 3rd row | uncertain |
| 4th row | cf. |
| 5th row | cf. |
| Value | Count | Frequency (%) |
| cf | 15633 | |
| uncertain | 1489 | 8.2% |
| aff | 600 | 3.3% |
| near | 404 | 2.2% |
| america | 8 | < 0.1% |
| north | 4 | < 0.1% |
| south | 3 | < 0.1% |
| brazil | 2 | < 0.1% |
| united | 2 | < 0.1% |
| states | 2 | < 0.1% |
| Other values (19) | 21 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| c | 17136 | |
| f | 16835 | |
| . | 16233 | |
| n | 3395 | 5.3% |
| a | 2527 | 3.9% |
| r | 1912 | 3.0% |
| e | 1911 | 3.0% |
| i | 1515 | 2.3% |
| t | 1509 | 2.3% |
| u | 1494 | 2.3% |
| Other values (23) | 144 | 0.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 48293 | |
| Other Punctuation | 16245 | 25.1% |
| Uppercase Letter | 43 | 0.1% |
| Space Separator | 30 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| c | 17136 | |
| f | 16835 | |
| n | 3395 | 7.0% |
| a | 2527 | 5.2% |
| r | 1912 | 4.0% |
| e | 1911 | 4.0% |
| i | 1515 | 3.1% |
| t | 1509 | 3.1% |
| u | 1494 | 3.1% |
| o | 17 | < 0.1% |
| Other values (8) | 42 | 0.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 12 | |
| S | 7 | |
| U | 4 | 9.3% |
| N | 4 | 9.3% |
| C | 4 | 9.3% |
| P | 4 | 9.3% |
| B | 2 | 4.7% |
| R | 2 | 4.7% |
| D | 1 | 2.3% |
| L | 1 | 2.3% |
| Other values (2) | 2 | 4.7% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 16233 | |
| , | 12 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 30 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 48336 | |
| Common | 16275 | 25.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| c | 17136 | |
| f | 16835 | |
| n | 3395 | 7.0% |
| a | 2527 | 5.2% |
| r | 1912 | 4.0% |
| e | 1911 | 4.0% |
| i | 1515 | 3.1% |
| t | 1509 | 3.1% |
| u | 1494 | 3.1% |
| o | 17 | < 0.1% |
| Other values (20) | 85 | 0.2% |
Common
| Value | Count | Frequency (%) |
| . | 16233 | |
| 30 | 0.2% | |
| , | 12 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 64611 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| c | 17136 | |
| f | 16835 | |
| . | 16233 | |
| n | 3395 | 5.3% |
| a | 2527 | 3.9% |
| r | 1912 | 3.0% |
| e | 1911 | 3.0% |
| i | 1515 | 2.3% |
| t | 1509 | 2.3% |
| u | 1494 | 2.3% |
| Other values (23) | 144 | 0.2% |
typeStatus
Text
Missing 
| Distinct | 97 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 1838230 |
| Missing (%) | 95.4% |
| Memory size | 14.7 MiB |
Length
| Max length | 40 |
|---|---|
| Median length | 8 |
| Mean length | 7.998383259 |
| Min length | 4 |
Unique
| Unique | 32 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Paratype |
|---|---|
| 2nd row | Holotype |
| 3rd row | Paratype |
| 4th row | Holotype |
| 5th row | Paratype |
| Value | Count | Frequency (%) |
| paratype | 41423 | |
| holotype | 26115 | |
| syntype | 10062 | 11.1% |
| type | 5398 | 5.9% |
| allotype | 3095 | 3.4% |
| paralectotype | 1159 | 1.3% |
| 1105 | 1.2% | |
| lectotype | 1071 | 1.2% |
| neotype | 306 | 0.3% |
| unconfirmed | 292 | 0.3% |
| Other values (25) | 916 | 1.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| y | 99691 | |
| e | 92427 | |
| p | 90036 | |
| t | 87236 | |
| a | 86598 | |
| o | 59520 | |
| P | 43202 | |
| r | 43155 | |
| l | 33479 | 4.8% |
| H | 26363 | 3.8% |
| Other values (20) | 40799 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 607854 | |
| Uppercase Letter | 89837 | 12.8% |
| Space Separator | 3111 | 0.4% |
| Math Symbol | 1105 | 0.2% |
| Other Punctuation | 599 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| y | 99691 | |
| e | 92427 | |
| p | 90036 | |
| t | 87236 | |
| a | 86598 | |
| o | 59520 | |
| r | 43155 | |
| l | 33479 | 5.5% |
| n | 11250 | 1.9% |
| c | 2538 | 0.4% |
| Other values (7) | 1924 | 0.3% |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 43202 | |
| H | 26363 | |
| S | 10065 | 11.2% |
| T | 5398 | 6.0% |
| A | 3103 | 3.5% |
| L | 1075 | 1.2% |
| N | 336 | 0.4% |
| U | 292 | 0.3% |
| C | 2 | < 0.1% |
| O | 1 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 3111 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 1105 |
Other Punctuation
| Value | Count | Frequency (%) |
| ; | 599 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 697691 | |
| Common | 4815 | 0.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| y | 99691 | |
| e | 92427 | |
| p | 90036 | |
| t | 87236 | |
| a | 86598 | |
| o | 59520 | |
| P | 43202 | |
| r | 43155 | |
| l | 33479 | 4.8% |
| H | 26363 | 3.8% |
| Other values (17) | 35984 | 5.2% |
Common
| Value | Count | Frequency (%) |
| 3111 | ||
| + | 1105 | 22.9% |
| ; | 599 | 12.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 702506 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| y | 99691 | |
| e | 92427 | |
| p | 90036 | |
| t | 87236 | |
| a | 86598 | |
| o | 59520 | |
| P | 43202 | |
| r | 43155 | |
| l | 33479 | 4.8% |
| H | 26363 | 3.8% |
| Other values (20) | 40799 |
identifiedBy
Text
Missing 
| Distinct | 13462 |
|---|---|
| Distinct (%) | 1.6% |
| Missing | 1085026 |
| Missing (%) | 56.3% |
| Memory size | 14.7 MiB |
Length
| Max length | 226 |
|---|---|
| Median length | 133 |
| Mean length | 38.24104467 |
| Min length | 2 |
Unique
| Unique | 4203 ? |
|---|---|
| Unique (%) | 0.5% |
Sample
| 1st row | Opresko, Dennis M., Oak Ridge National Laboratory (UNITED STATES) |
|---|---|
| 2nd row | Nance |
| 3rd row | Mah, Christopher, (IZ), Smithsonian Institution - National Museum of Natural History (UNITED STATES) |
| 4th row | Verrill, Addison E., Peabody Museum, Yale |
| 5th row | Judkins, D. |
| Value | Count | Frequency (%) |
| of | 247151 | 5.3% |
| museum | 200612 | 4.3% |
| national | 197093 | 4.2% |
| institution | 188563 | 4.1% |
| smithsonian | 186033 | 4.0% |
| natural | 185749 | 4.0% |
| history | 185395 | 4.0% |
| united | 130387 | 2.8% |
| states | 129618 | 2.8% |
| 87179 | 1.9% | |
| Other values (9433) | 2903748 |
Most occurring characters
| Value | Count | Frequency (%) |
| 3800493 | 11.8% | |
| a | 2080181 | 6.5% |
| i | 2055895 | 6.4% |
| t | 2012865 | 6.3% |
| n | 1895740 | 5.9% |
| o | 1744508 | 5.4% |
| e | 1499826 | 4.7% |
| r | 1384664 | 4.3% |
| s | 1382519 | 4.3% |
| , | 1349141 | 4.2% |
| Other values (84) | 12956225 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 19463991 | |
| Uppercase Letter | 5956500 | 18.5% |
| Space Separator | 3800493 | 11.8% |
| Other Punctuation | 2376958 | 7.4% |
| Open Punctuation | 230274 | 0.7% |
| Close Punctuation | 230274 | 0.7% |
| Dash Punctuation | 97626 | 0.3% |
| Decimal Number | 5852 | < 0.1% |
| Math Symbol | 89 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 2080181 | |
| i | 2055895 | |
| t | 2012865 | |
| n | 1895740 | |
| o | 1744508 | |
| e | 1499826 | |
| r | 1384664 | |
| s | 1382519 | |
| u | 1079640 | 5.5% |
| l | 969545 | 5.0% |
| Other values (37) | 3358608 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 646247 | 10.8% |
| N | 570315 | 9.6% |
| M | 471177 | 7.9% |
| I | 456181 | 7.7% |
| T | 454039 | 7.6% |
| H | 422870 | 7.1% |
| E | 378757 | 6.4% |
| A | 333690 | 5.6% |
| D | 272572 | 4.6% |
| C | 241386 | 4.1% |
| Other values (18) | 1709266 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 1349141 | |
| . | 937156 | |
| ; | 64066 | 2.7% |
| / | 16438 | 0.7% |
| & | 5585 | 0.2% |
| ' | 4526 | 0.2% |
| " | 46 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 2732 | |
| 1 | 2732 | |
| 2 | 148 | 2.5% |
| 0 | 92 | 1.6% |
| 6 | 74 | 1.3% |
| 9 | 74 | 1.3% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 97619 | |
| – | 7 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 3800493 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 230274 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 230274 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 89 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 25420491 | |
| Common | 6741566 | 21.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 2080181 | 8.2% |
| i | 2055895 | 8.1% |
| t | 2012865 | 7.9% |
| n | 1895740 | 7.5% |
| o | 1744508 | 6.9% |
| e | 1499826 | 5.9% |
| r | 1384664 | 5.4% |
| s | 1382519 | 5.4% |
| u | 1079640 | 4.2% |
| l | 969545 | 3.8% |
| Other values (65) | 9315108 |
Common
| Value | Count | Frequency (%) |
| 3800493 | ||
| , | 1349141 | 20.0% |
| . | 937156 | 13.9% |
| ( | 230274 | 3.4% |
| ) | 230274 | 3.4% |
| - | 97619 | 1.4% |
| ; | 64066 | 1.0% |
| / | 16438 | 0.2% |
| & | 5585 | 0.1% |
| ' | 4526 | 0.1% |
| Other values (9) | 5994 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 32156515 | |
| None | 5535 | < 0.1% |
| Punctuation | 7 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 3800493 | 11.8% | |
| a | 2080181 | 6.5% |
| i | 2055895 | 6.4% |
| t | 2012865 | 6.3% |
| n | 1895740 | 5.9% |
| o | 1744508 | 5.4% |
| e | 1499826 | 4.7% |
| r | 1384664 | 4.3% |
| s | 1382519 | 4.3% |
| , | 1349141 | 4.2% |
| Other values (60) | 12950683 |
None
| Value | Count | Frequency (%) |
| é | 1458 | |
| í | 1289 | |
| á | 848 | |
| ñ | 436 | 7.9% |
| ã | 401 | 7.2% |
| è | 285 | 5.1% |
| ö | 217 | 3.9% |
| ç | 159 | 2.9% |
| ó | 99 | 1.8% |
| ø | 98 | 1.8% |
| Other values (13) | 245 | 4.4% |
Punctuation
| Value | Count | Frequency (%) |
| – | 7 |
identifiedByID
Text
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 1926059 |
| Missing (%) | > 99.9% |
| Memory size | 14.7 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 7 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 31.1435 |
|---|---|
| 2nd row | Plagiorchiida |
| Value | Count | Frequency (%) |
| 31.1435 | 1 | |
| plagiorchiida | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 3 | |
| 3 | 2 | 10.0% |
| 1 | 2 | 10.0% |
| a | 2 | 10.0% |
| . | 1 | 5.0% |
| 4 | 1 | 5.0% |
| 5 | 1 | 5.0% |
| P | 1 | 5.0% |
| l | 1 | 5.0% |
| g | 1 | 5.0% |
| Other values (5) | 5 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 12 | |
| Decimal Number | 6 | |
| Other Punctuation | 1 | 5.0% |
| Uppercase Letter | 1 | 5.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 3 | |
| a | 2 | |
| l | 1 | 8.3% |
| g | 1 | 8.3% |
| o | 1 | 8.3% |
| r | 1 | 8.3% |
| c | 1 | 8.3% |
| h | 1 | 8.3% |
| d | 1 | 8.3% |
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 2 | |
| 1 | 2 | |
| 4 | 1 | |
| 5 | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 13 | |
| Common | 7 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 3 | |
| a | 2 | |
| P | 1 | 7.7% |
| l | 1 | 7.7% |
| g | 1 | 7.7% |
| o | 1 | 7.7% |
| r | 1 | 7.7% |
| c | 1 | 7.7% |
| h | 1 | 7.7% |
| d | 1 | 7.7% |
Common
| Value | Count | Frequency (%) |
| 3 | 2 | |
| 1 | 2 | |
| . | 1 | |
| 4 | 1 | |
| 5 | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 20 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 3 | |
| 3 | 2 | 10.0% |
| 1 | 2 | 10.0% |
| a | 2 | 10.0% |
| . | 1 | 5.0% |
| 4 | 1 | 5.0% |
| 5 | 1 | 5.0% |
| P | 1 | 5.0% |
| l | 1 | 5.0% |
| g | 1 | 5.0% |
| Other values (5) | 5 |
dateIdentified
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 1926060 |
| Missing (%) | > 99.9% |
| Memory size | 14.7 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 8 |
| Min length | 8 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | -83.7685 |
|---|
| Value | Count | Frequency (%) |
| 83.7685 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 8 | 2 | |
| - | 1 | |
| 3 | 1 | |
| . | 1 | |
| 7 | 1 | |
| 6 | 1 | |
| 5 | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 6 | |
| Dash Punctuation | 1 | 12.5% |
| Other Punctuation | 1 | 12.5% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 8 | 2 | |
| 3 | 1 | |
| 7 | 1 | |
| 6 | 1 | |
| 5 | 1 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 8 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 8 | 2 | |
| - | 1 | |
| 3 | 1 | |
| . | 1 | |
| 7 | 1 | |
| 6 | 1 | |
| 5 | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 8 | 2 | |
| - | 1 | |
| 3 | 1 | |
| . | 1 | |
| 7 | 1 | |
| 6 | 1 | |
| 5 | 1 |
Missing 
| Distinct | 7 |
|---|---|
| Distinct (%) | 77.8% |
| Missing | 1926052 |
| Missing (%) | > 99.9% |
| Memory size | 14.7 MiB |
Length
| Max length | 14 |
|---|---|
| Median length | 13 |
| Mean length | 9.777777778 |
| Min length | 6 |
Unique
| Unique | 5 ? |
|---|---|
| Unique (%) | 55.6% |
Sample
| 1st row | United States |
|---|---|
| 2nd row | Brazil |
| 3rd row | Puerto Rico |
| 4th row | Argentina |
| 5th row | United States |
| Value | Count | Frequency (%) |
| united | 2 | |
| states | 2 | |
| brazil | 2 | |
| puerto | 1 | |
| rico | 1 | |
| argentina | 1 | |
| costa | 1 | |
| rica | 1 | |
| dicrocoeliidae | 1 | |
| panama | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 11 | |
| i | 10 | |
| t | 9 | 10.2% |
| e | 8 | 9.1% |
| o | 5 | 5.7% |
| r | 5 | 5.7% |
| n | 5 | 5.7% |
| 4 | 4.5% | |
| c | 4 | 4.5% |
| d | 3 | 3.4% |
| Other values (14) | 24 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 71 | |
| Uppercase Letter | 13 | 14.8% |
| Space Separator | 4 | 4.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 11 | |
| i | 10 | |
| t | 9 | |
| e | 8 | |
| o | 5 | |
| r | 5 | |
| n | 5 | |
| c | 4 | 5.6% |
| d | 3 | 4.2% |
| s | 3 | 4.2% |
| Other values (5) | 8 |
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 2 | |
| R | 2 | |
| P | 2 | |
| B | 2 | |
| S | 2 | |
| A | 1 | |
| C | 1 | |
| D | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 4 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 84 | |
| Common | 4 | 4.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 11 | |
| i | 10 | |
| t | 9 | |
| e | 8 | 9.5% |
| o | 5 | 6.0% |
| r | 5 | 6.0% |
| n | 5 | 6.0% |
| c | 4 | 4.8% |
| d | 3 | 3.6% |
| s | 3 | 3.6% |
| Other values (13) | 21 |
Common
| Value | Count | Frequency (%) |
| 4 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 88 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 11 | |
| i | 10 | |
| t | 9 | 10.2% |
| e | 8 | 9.1% |
| o | 5 | 5.7% |
| r | 5 | 5.7% |
| n | 5 | 5.7% |
| 4 | 4.5% | |
| c | 4 | 4.5% |
| d | 3 | 3.4% |
| Other values (14) | 24 |
Missing 
| Distinct | 5 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 1926056 |
| Missing (%) | > 99.9% |
| Memory size | 14.7 MiB |
Length
| Max length | 20 |
|---|---|
| Median length | 9 |
| Mean length | 10.8 |
| Min length | 8 |
Unique
| Unique | 5 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | District of Columbia |
|---|---|
| 2nd row | Amazonas |
| 3rd row | Louisiana |
| 4th row | Sao Paulo |
| 5th row | San Jose |
| Value | Count | Frequency (%) |
| district | 1 | |
| of | 1 | |
| columbia | 1 | |
| amazonas | 1 | |
| louisiana | 1 | |
| sao | 1 | |
| paulo | 1 | |
| san | 1 | |
| jose | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 8 | |
| o | 7 | |
| i | 5 | 9.3% |
| s | 4 | 7.4% |
| 4 | 7.4% | |
| u | 3 | 5.6% |
| n | 3 | 5.6% |
| t | 2 | 3.7% |
| S | 2 | 3.7% |
| l | 2 | 3.7% |
| Other values (13) | 14 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 42 | |
| Uppercase Letter | 8 | 14.8% |
| Space Separator | 4 | 7.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 8 | |
| o | 7 | |
| i | 5 | |
| s | 4 | |
| u | 3 | 7.1% |
| n | 3 | 7.1% |
| t | 2 | 4.8% |
| l | 2 | 4.8% |
| m | 2 | 4.8% |
| z | 1 | 2.4% |
| Other values (5) | 5 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 2 | |
| J | 1 | |
| P | 1 | |
| L | 1 | |
| D | 1 | |
| A | 1 | |
| C | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 4 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 50 | |
| Common | 4 | 7.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 8 | |
| o | 7 | |
| i | 5 | |
| s | 4 | 8.0% |
| u | 3 | 6.0% |
| n | 3 | 6.0% |
| t | 2 | 4.0% |
| S | 2 | 4.0% |
| l | 2 | 4.0% |
| m | 2 | 4.0% |
| Other values (12) | 12 |
Common
| Value | Count | Frequency (%) |
| 4 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 54 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 8 | |
| o | 7 | |
| i | 5 | 9.3% |
| s | 4 | 7.4% |
| 4 | 7.4% | |
| u | 3 | 5.6% |
| n | 3 | 5.6% |
| t | 2 | 3.7% |
| S | 2 | 3.7% |
| l | 2 | 3.7% |
| Other values (13) | 14 |
scientificNameID
Text
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 1926059 |
| Missing (%) | > 99.9% |
| Memory size | 14.7 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Hemionchos |
|---|---|
| 2nd row | Conspicuum |
| Value | Count | Frequency (%) |
| hemionchos | 1 | |
| conspicuum | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 3 | |
| m | 2 | |
| i | 2 | |
| n | 2 | |
| c | 2 | |
| s | 2 | |
| u | 2 | |
| H | 1 | 5.0% |
| e | 1 | 5.0% |
| h | 1 | 5.0% |
| Other values (2) | 2 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 18 | |
| Uppercase Letter | 2 | 10.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 3 | |
| m | 2 | |
| i | 2 | |
| n | 2 | |
| c | 2 | |
| s | 2 | |
| u | 2 | |
| e | 1 | 5.6% |
| h | 1 | 5.6% |
| p | 1 | 5.6% |
Uppercase Letter
| Value | Count | Frequency (%) |
| H | 1 | |
| C | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 20 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 3 | |
| m | 2 | |
| i | 2 | |
| n | 2 | |
| c | 2 | |
| s | 2 | |
| u | 2 | |
| H | 1 | 5.0% |
| e | 1 | 5.0% |
| h | 1 | 5.0% |
| Other values (2) | 2 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 20 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 3 | |
| m | 2 | |
| i | 2 | |
| n | 2 | |
| c | 2 | |
| s | 2 | |
| u | 2 | |
| H | 1 | 5.0% |
| e | 1 | 5.0% |
| h | 1 | 5.0% |
| Other values (2) | 2 |
Missing 
| Distinct | 8 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 1926053 |
| Missing (%) | > 99.9% |
| Memory size | 14.7 MiB |
Length
| Max length | 56 |
|---|---|
| Median length | 11.5 |
| Mean length | 19.5 |
| Min length | 4 |
Unique
| Unique | 8 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Washington DC |
|---|---|
| 2nd row | Manaus, Rio Solimoes, Ilha Da Marchantaria |
| 3rd row | Ponce |
| 4th row | Azul |
| 5th row | Raceland |
| Value | Count | Frequency (%) |
| washington | 1 | 4.2% |
| dc | 1 | 4.2% |
| carcoles | 1 | 4.2% |
| off | 1 | 4.2% |
| piracicaba | 1 | 4.2% |
| frane | 1 | 4.2% |
| camora | 1 | 4.2% |
| from | 1 | 4.2% |
| segment | 1 | 4.2% |
| endeavour | 1 | 4.2% |
| Other values (14) | 14 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 21 | 13.5% |
| 16 | 10.3% | |
| n | 11 | 7.1% |
| e | 10 | 6.4% |
| o | 10 | 6.4% |
| i | 8 | 5.1% |
| r | 8 | 5.1% |
| c | 7 | 4.5% |
| u | 5 | 3.2% |
| l | 5 | 3.2% |
| Other values (24) | 55 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 112 | |
| Uppercase Letter | 24 | 15.4% |
| Space Separator | 16 | 10.3% |
| Other Punctuation | 4 | 2.6% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 21 | |
| n | 11 | |
| e | 10 | |
| o | 10 | |
| i | 8 | 7.1% |
| r | 8 | 7.1% |
| c | 7 | 6.2% |
| u | 5 | 4.5% |
| l | 5 | 4.5% |
| t | 4 | 3.6% |
| Other values (9) | 23 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 3 | |
| R | 3 | |
| P | 3 | |
| F | 3 | |
| S | 2 | |
| M | 2 | |
| D | 2 | |
| I | 1 | 4.2% |
| A | 1 | 4.2% |
| J | 1 | 4.2% |
| Other values (3) | 3 |
Space Separator
| Value | Count | Frequency (%) |
| 16 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 4 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 136 | |
| Common | 20 | 12.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 21 | |
| n | 11 | 8.1% |
| e | 10 | 7.4% |
| o | 10 | 7.4% |
| i | 8 | 5.9% |
| r | 8 | 5.9% |
| c | 7 | 5.1% |
| u | 5 | 3.7% |
| l | 5 | 3.7% |
| t | 4 | 2.9% |
| Other values (22) | 47 |
Common
| Value | Count | Frequency (%) |
| 16 | ||
| , | 4 | 20.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 156 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 21 | 13.5% |
| 16 | 10.3% | |
| n | 11 | 7.1% |
| e | 10 | 6.4% |
| o | 10 | 6.4% |
| i | 8 | 5.1% |
| r | 8 | 5.1% |
| c | 7 | 4.5% |
| u | 5 | 3.2% |
| l | 5 | 3.2% |
| Other values (24) | 55 |
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 1926059 |
| Missing (%) | > 99.9% |
| Memory size | 14.7 MiB |
Length
| Max length | 11 |
|---|---|
| Median length | 9.5 |
| Mean length | 9.5 |
| Min length | 8 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | striatus |
|---|---|
| 2nd row | icteridorum |
| Value | Count | Frequency (%) |
| striatus | 1 | |
| icteridorum | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 3 | |
| r | 3 | |
| i | 3 | |
| s | 2 | |
| u | 2 | |
| a | 1 | 5.3% |
| c | 1 | 5.3% |
| e | 1 | 5.3% |
| d | 1 | 5.3% |
| o | 1 | 5.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 19 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 3 | |
| r | 3 | |
| i | 3 | |
| s | 2 | |
| u | 2 | |
| a | 1 | 5.3% |
| c | 1 | 5.3% |
| e | 1 | 5.3% |
| d | 1 | 5.3% |
| o | 1 | 5.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 19 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 3 | |
| r | 3 | |
| i | 3 | |
| s | 2 | |
| u | 2 | |
| a | 1 | 5.3% |
| c | 1 | 5.3% |
| e | 1 | 5.3% |
| d | 1 | 5.3% |
| o | 1 | 5.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 19 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| t | 3 | |
| r | 3 | |
| i | 3 | |
| s | 2 | |
| u | 2 | |
| a | 1 | 5.3% |
| c | 1 | 5.3% |
| e | 1 | 5.3% |
| d | 1 | 5.3% |
| o | 1 | 5.3% |
scientificName
Text
Missing 
| Distinct | 133983 |
|---|---|
| Distinct (%) | 8.5% |
| Missing | 353701 |
| Missing (%) | 18.4% |
| Memory size | 14.7 MiB |
Length
| Max length | 85 |
|---|---|
| Median length | 59 |
| Mean length | 19.4468843 |
| Min length | 4 |
Unique
| Unique | 51616 ? |
|---|---|
| Unique (%) | 3.3% |
Sample
| 1st row | Scypha sp. |
|---|---|
| 2nd row | Bulla striata |
| 3rd row | Stylopathes columnaris |
| 4th row | Ophiothrix suensonii |
| 5th row | Cypraea labrolineata |
| Value | Count | Frequency (%) |
| sp | 198030 | 6.0% |
| conus | 24321 | 0.7% |
| cypraea | 15393 | 0.5% |
| cambarus | 12002 | 0.4% |
| cerithium | 9394 | 0.3% |
| orconectes | 8683 | 0.3% |
| procambarus | 8139 | 0.2% |
| nassarius | 6728 | 0.2% |
| gracilis | 6630 | 0.2% |
| terebra | 5167 | 0.2% |
| Other values (70823) | 3024717 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 3609833 | 11.8% |
| i | 2749960 | 9.0% |
| s | 2277139 | 7.4% |
| e | 1954011 | 6.4% |
| r | 1901028 | 6.2% |
| o | 1840277 | 6.0% |
| 1746844 | 5.7% | |
| l | 1713984 | 5.6% |
| n | 1541454 | 5.0% |
| t | 1536972 | 5.0% |
| Other values (68) | 9706001 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 26720492 | |
| Space Separator | 1746844 | 5.7% |
| Uppercase Letter | 1685116 | 5.5% |
| Other Punctuation | 198760 | 0.7% |
| Open Punctuation | 112845 | 0.4% |
| Close Punctuation | 112845 | 0.4% |
| Decimal Number | 473 | < 0.1% |
| Dash Punctuation | 110 | < 0.1% |
| Math Symbol | 18 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 3609833 | |
| i | 2749960 | |
| s | 2277139 | 8.5% |
| e | 1954011 | 7.3% |
| r | 1901028 | 7.1% |
| o | 1840277 | 6.9% |
| l | 1713984 | 6.4% |
| n | 1541454 | 5.8% |
| t | 1536972 | 5.8% |
| u | 1522029 | 5.7% |
| Other values (18) | 6073805 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 246678 | |
| P | 236724 | |
| A | 165911 | |
| S | 135225 | 8.0% |
| M | 109797 | 6.5% |
| T | 106733 | 6.3% |
| L | 97688 | 5.8% |
| E | 85762 | 5.1% |
| O | 78979 | 4.7% |
| N | 66249 | 3.9% |
| Other values (16) | 355370 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 156 | |
| 8 | 111 | |
| 4 | 58 | 12.3% |
| 9 | 38 | 8.0% |
| 6 | 27 | 5.7% |
| 2 | 27 | 5.7% |
| 5 | 19 | 4.0% |
| 7 | 16 | 3.4% |
| 0 | 14 | 3.0% |
| 3 | 7 | 1.5% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 198543 | |
| , | 107 | 0.1% |
| " | 60 | < 0.1% |
| / | 29 | < 0.1% |
| ' | 15 | < 0.1% |
| & | 3 | < 0.1% |
| ? | 3 | < 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 112844 | |
| [ | 1 | < 0.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 112844 | |
| ] | 1 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 1746844 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 110 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 18 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 28405608 | |
| Common | 2171895 | 7.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 3609833 | |
| i | 2749960 | 9.7% |
| s | 2277139 | 8.0% |
| e | 1954011 | 6.9% |
| r | 1901028 | 6.7% |
| o | 1840277 | 6.5% |
| l | 1713984 | 6.0% |
| n | 1541454 | 5.4% |
| t | 1536972 | 5.4% |
| u | 1522029 | 5.4% |
| Other values (44) | 7758921 |
Common
| Value | Count | Frequency (%) |
| 1746844 | ||
| . | 198543 | 9.1% |
| ( | 112844 | 5.2% |
| ) | 112844 | 5.2% |
| 1 | 156 | < 0.1% |
| 8 | 111 | < 0.1% |
| - | 110 | < 0.1% |
| , | 107 | < 0.1% |
| " | 60 | < 0.1% |
| 4 | 58 | < 0.1% |
| Other values (14) | 218 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 30577487 | |
| None | 16 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 3609833 | 11.8% |
| i | 2749960 | 9.0% |
| s | 2277139 | 7.4% |
| e | 1954011 | 6.4% |
| r | 1901028 | 6.2% |
| o | 1840277 | 6.0% |
| 1746844 | 5.7% | |
| l | 1713984 | 5.6% |
| n | 1541454 | 5.0% |
| t | 1536972 | 5.0% |
| Other values (66) | 9705985 |
None
| Value | Count | Frequency (%) |
| ü | 15 | |
| æ | 1 | 6.2% |
parentNameUsage
Text
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 1926059 |
| Missing (%) | > 99.9% |
| Memory size | 14.7 MiB |
Length
| Max length | 20 |
|---|---|
| Median length | 16.5 |
| Mean length | 16.5 |
| Min length | 13 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Campbell & Beveridge |
|---|---|
| 2nd row | Denton & Byrd |
| Value | Count | Frequency (%) |
| 2 | ||
| campbell | 1 | |
| beveridge | 1 | |
| denton | 1 | |
| byrd | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 5 | |
| 4 | 12.1% | |
| d | 2 | 6.1% |
| n | 2 | 6.1% |
| l | 2 | 6.1% |
| & | 2 | 6.1% |
| B | 2 | 6.1% |
| r | 2 | 6.1% |
| C | 1 | 3.0% |
| o | 1 | 3.0% |
| Other values (10) | 10 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 23 | |
| Space Separator | 4 | 12.1% |
| Uppercase Letter | 4 | 12.1% |
| Other Punctuation | 2 | 6.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 5 | |
| d | 2 | 8.7% |
| n | 2 | 8.7% |
| l | 2 | 8.7% |
| r | 2 | 8.7% |
| o | 1 | 4.3% |
| t | 1 | 4.3% |
| g | 1 | 4.3% |
| v | 1 | 4.3% |
| i | 1 | 4.3% |
| Other values (5) | 5 |
Uppercase Letter
| Value | Count | Frequency (%) |
| B | 2 | |
| C | 1 | |
| D | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 4 |
Other Punctuation
| Value | Count | Frequency (%) |
| & | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 27 | |
| Common | 6 | 18.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 5 | |
| d | 2 | 7.4% |
| n | 2 | 7.4% |
| l | 2 | 7.4% |
| B | 2 | 7.4% |
| r | 2 | 7.4% |
| C | 1 | 3.7% |
| o | 1 | 3.7% |
| t | 1 | 3.7% |
| D | 1 | 3.7% |
| Other values (8) | 8 |
Common
| Value | Count | Frequency (%) |
| 4 | ||
| & | 2 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 33 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 5 | |
| 4 | 12.1% | |
| d | 2 | 6.1% |
| n | 2 | 6.1% |
| l | 2 | 6.1% |
| & | 2 | 6.1% |
| B | 2 | 6.1% |
| r | 2 | 6.1% |
| C | 1 | 3.0% |
| o | 1 | 3.0% |
| Other values (10) | 10 |
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 1926060 |
| Missing (%) | > 99.9% |
| Memory size | 14.7 MiB |
Length
| Max length | 9 |
|---|---|
| Median length | 9 |
| Mean length | 9 |
| Min length | 9 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | GEOLocate |
|---|
| Value | Count | Frequency (%) |
| geolocate | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| G | 1 | |
| E | 1 | |
| O | 1 | |
| L | 1 | |
| o | 1 | |
| c | 1 | |
| a | 1 | |
| t | 1 | |
| e | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 5 | |
| Uppercase Letter | 4 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 1 | |
| c | 1 | |
| a | 1 | |
| t | 1 | |
| e | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| G | 1 | |
| E | 1 | |
| O | 1 | |
| L | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 9 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| G | 1 | |
| E | 1 | |
| O | 1 | |
| L | 1 | |
| o | 1 | |
| c | 1 | |
| a | 1 | |
| t | 1 | |
| e | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| G | 1 | |
| E | 1 | |
| O | 1 | |
| L | 1 | |
| o | 1 | |
| c | 1 | |
| a | 1 | |
| t | 1 | |
| e | 1 |
| Distinct | 4360 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 474 |
| Missing (%) | < 0.1% |
| Memory size | 14.7 MiB |
Length
| Max length | 134 |
|---|---|
| Median length | 117 |
| Mean length | 62.96713054 |
| Min length | 5 |
Unique
| Unique | 592 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Animalia, Porifera, Calcarea |
|---|---|
| 2nd row | Animalia, Mollusca, Gastropoda, Bullidae |
| 3rd row | Animalia, Cnidaria, Anthozoa, Hexacorallia, Antipatharia, Stylopathidae |
| 4th row | Animalia, Echinodermata, Ophiuroidea, Ophiurida, Ophiotrichidae |
| 5th row | Animalia, Mollusca, Gastropoda, Cypraeidae |
| Value | Count | Frequency (%) |
| animalia | 1921701 | 18.1% |
| mollusca | 866254 | 8.1% |
| gastropoda | 612643 | 5.8% |
| arthropoda | 390685 | 3.7% |
| crustacea | 385047 | 3.6% |
| malacostraca | 301920 | 2.8% |
| eumalacostraca | 294842 | 2.8% |
| annelida | 241745 | 2.3% |
| polychaeta | 212926 | 2.0% |
| bivalvia | 207657 | 2.0% |
| Other values (4348) | 5201888 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 19357102 | |
| i | 10627524 | 8.8% |
| 8711721 | 7.2% | |
| , | 8690181 | 7.2% |
| o | 7922403 | 6.5% |
| l | 7524872 | 6.2% |
| e | 6161710 | 5.1% |
| d | 5674220 | 4.7% |
| r | 5611641 | 4.6% |
| c | 5022856 | 4.1% |
| Other values (60) | 35944458 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 93230361 | |
| Uppercase Letter | 10615593 | 8.8% |
| Space Separator | 8711721 | 7.2% |
| Other Punctuation | 8690229 | 7.2% |
| Dash Punctuation | 285 | < 0.1% |
| Open Punctuation | 169 | < 0.1% |
| Close Punctuation | 169 | < 0.1% |
| Connector Punctuation | 126 | < 0.1% |
| Decimal Number | 32 | < 0.1% |
| Math Symbol | 3 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 19357102 | |
| i | 10627524 | |
| o | 7922403 | |
| l | 7524872 | 8.1% |
| e | 6161710 | 6.6% |
| d | 5674220 | 6.1% |
| r | 5611641 | 6.0% |
| c | 5022856 | 5.4% |
| n | 4723054 | 5.1% |
| t | 4392846 | 4.7% |
| Other values (16) | 16212133 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 2993156 | |
| M | 1365520 | |
| C | 1144907 | 10.8% |
| P | 1045845 | 9.9% |
| E | 845954 | 8.0% |
| G | 714563 | 6.7% |
| S | 488541 | 4.6% |
| D | 334992 | 3.2% |
| B | 296918 | 2.8% |
| T | 261542 | 2.5% |
| Other values (15) | 1123655 | 10.6% |
Decimal Number
| Value | Count | Frequency (%) |
| 7 | 7 | |
| 2 | 5 | |
| 9 | 5 | |
| 5 | 3 | |
| 3 | 3 | |
| 8 | 3 | |
| 1 | 2 | 6.2% |
| 4 | 2 | 6.2% |
| 0 | 1 | 3.1% |
| 6 | 1 | 3.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 8690181 | |
| . | 34 | < 0.1% |
| ? | 14 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 8711721 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 285 |
Open Punctuation
| Value | Count | Frequency (%) |
| [ | 169 |
Close Punctuation
| Value | Count | Frequency (%) |
| ] | 169 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 126 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 3 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 103845954 | |
| Common | 17402734 | 14.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 19357102 | |
| i | 10627524 | 10.2% |
| o | 7922403 | 7.6% |
| l | 7524872 | 7.2% |
| e | 6161710 | 5.9% |
| d | 5674220 | 5.5% |
| r | 5611641 | 5.4% |
| c | 5022856 | 4.8% |
| n | 4723054 | 4.5% |
| t | 4392846 | 4.2% |
| Other values (41) | 26827726 |
Common
| Value | Count | Frequency (%) |
| 8711721 | ||
| , | 8690181 | |
| - | 285 | < 0.1% |
| [ | 169 | < 0.1% |
| ] | 169 | < 0.1% |
| _ | 126 | < 0.1% |
| . | 34 | < 0.1% |
| ? | 14 | < 0.1% |
| 7 | 7 | < 0.1% |
| 2 | 5 | < 0.1% |
| Other values (9) | 23 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 121248688 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 19357102 | |
| i | 10627524 | 8.8% |
| 8711721 | 7.2% | |
| , | 8690181 | 7.2% |
| o | 7922403 | 6.5% |
| l | 7524872 | 6.2% |
| e | 6161710 | 5.1% |
| d | 5674220 | 4.7% |
| r | 5611641 | 4.6% |
| c | 5022856 | 4.1% |
| Other values (60) | 35944458 |
kingdom
Text
| Distinct | 13 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2074 |
| Missing (%) | 0.1% |
| Memory size | 14.7 MiB |
Length
| Max length | 9 |
|---|---|
| Median length | 8 |
| Mean length | 8.00002079 |
| Min length | 7 |
Unique
| Unique | 7 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Animalia |
|---|---|
| 2nd row | Animalia |
| 3rd row | Animalia |
| 4th row | Animalia |
| 5th row | Animalia |
| Value | Count | Frequency (%) |
| animalia | 1921701 | |
| protozoa | 2154 | 0.1% |
| protista | 55 | < 0.1% |
| chromista | 36 | < 0.1% |
| bacteria | 28 | < 0.1% |
| eukaryota | 6 | < 0.1% |
| eukarya | 1 | < 0.1% |
| 77.0364 | 1 | < 0.1% |
| 59.9317 | 1 | < 0.1% |
| 59.8585 | 1 | < 0.1% |
| Other values (3) | 3 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 3845717 | |
| i | 3843521 | |
| m | 1921737 | |
| A | 1921701 | |
| l | 1921701 | |
| n | 1921701 | |
| o | 6559 | < 0.1% |
| t | 2334 | < 0.1% |
| r | 2280 | < 0.1% |
| P | 2209 | < 0.1% |
| Other values (23) | 2476 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 13467908 | |
| Uppercase Letter | 1923981 | 12.5% |
| Decimal Number | 35 | < 0.1% |
| Dash Punctuation | 6 | < 0.1% |
| Other Punctuation | 6 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 3845717 | |
| i | 3843521 | |
| m | 1921737 | |
| l | 1921701 | |
| n | 1921701 | |
| o | 6559 | < 0.1% |
| t | 2334 | < 0.1% |
| r | 2280 | < 0.1% |
| z | 2154 | < 0.1% |
| s | 91 | < 0.1% |
| Other values (6) | 113 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 9 | 8 | |
| 5 | 5 | |
| 0 | 5 | |
| 7 | 5 | |
| 8 | 3 | 8.6% |
| 6 | 2 | 5.7% |
| 4 | 2 | 5.7% |
| 3 | 2 | 5.7% |
| 1 | 2 | 5.7% |
| 2 | 1 | 2.9% |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 1921701 | |
| P | 2209 | 0.1% |
| C | 36 | < 0.1% |
| B | 28 | < 0.1% |
| E | 7 | < 0.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 6 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 6 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 15391889 | |
| Common | 47 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 3845717 | |
| i | 3843521 | |
| m | 1921737 | |
| A | 1921701 | |
| l | 1921701 | |
| n | 1921701 | |
| o | 6559 | < 0.1% |
| t | 2334 | < 0.1% |
| r | 2280 | < 0.1% |
| P | 2209 | < 0.1% |
| Other values (11) | 2429 | < 0.1% |
Common
| Value | Count | Frequency (%) |
| 9 | 8 | |
| - | 6 | |
| . | 6 | |
| 5 | 5 | |
| 0 | 5 | |
| 7 | 5 | |
| 8 | 3 | 6.4% |
| 6 | 2 | 4.3% |
| 4 | 2 | 4.3% |
| 3 | 2 | 4.3% |
| Other values (2) | 3 | 6.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 15391936 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 3845717 | |
| i | 3843521 | |
| m | 1921737 | |
| A | 1921701 | |
| l | 1921701 | |
| n | 1921701 | |
| o | 6559 | < 0.1% |
| t | 2334 | < 0.1% |
| r | 2280 | < 0.1% |
| P | 2209 | < 0.1% |
| Other values (23) | 2476 | < 0.1% |
phylum
Text
| Distinct | 84 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 525 |
| Missing (%) | < 0.1% |
| Memory size | 14.7 MiB |
Length
| Max length | 31 |
|---|---|
| Median length | 8 |
| Mean length | 8.859807347 |
| Min length | 5 |
Unique
| Unique | 11 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Porifera |
|---|---|
| 2nd row | Mollusca |
| 3rd row | Cnidaria |
| 4th row | Echinodermata |
| 5th row | Mollusca |
| Value | Count | Frequency (%) |
| mollusca | 866254 | |
| arthropoda | 390685 | |
| annelida | 241588 | 12.5% |
| cnidaria | 117378 | 6.1% |
| echinodermata | 91192 | 4.7% |
| nematoda | 68776 | 3.6% |
| platyhelminthes | 46010 | 2.4% |
| porifera | 32720 | 1.7% |
| chordata | 19744 | 1.0% |
| sipuncula | 10414 | 0.5% |
| Other values (84) | 42151 | 2.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 2241799 | |
| l | 2085545 | |
| o | 1908402 | |
| r | 1107910 | 6.5% |
| c | 989189 | 5.8% |
| d | 934105 | 5.5% |
| s | 915271 | 5.4% |
| u | 887926 | 5.2% |
| M | 867048 | 5.1% |
| n | 770924 | 4.5% |
| Other values (40) | 4351759 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 15132549 | |
| Uppercase Letter | 1925540 | 11.3% |
| Space Separator | 1376 | < 0.1% |
| Dash Punctuation | 283 | < 0.1% |
| Connector Punctuation | 126 | < 0.1% |
| Decimal Number | 4 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 2241799 | |
| l | 2085545 | |
| o | 1908402 | |
| r | 1107910 | |
| c | 989189 | 6.5% |
| d | 934105 | 6.2% |
| s | 915271 | 6.0% |
| u | 887926 | 5.9% |
| n | 770924 | 5.1% |
| t | 685639 | 4.5% |
| Other values (14) | 2605839 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 867048 | |
| A | 638618 | |
| C | 140083 | 7.3% |
| E | 91424 | 4.7% |
| P | 80493 | 4.2% |
| N | 75155 | 3.9% |
| S | 11217 | 0.6% |
| B | 10349 | 0.5% |
| K | 6388 | 0.3% |
| H | 2153 | 0.1% |
| Other values (11) | 2612 | 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 8 | 2 | |
| 4 | 2 |
Space Separator
| Value | Count | Frequency (%) |
| 1376 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 283 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 126 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 17058089 | |
| Common | 1789 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 2241799 | |
| l | 2085545 | |
| o | 1908402 | |
| r | 1107910 | 6.5% |
| c | 989189 | 5.8% |
| d | 934105 | 5.5% |
| s | 915271 | 5.4% |
| u | 887926 | 5.2% |
| M | 867048 | 5.1% |
| n | 770924 | 4.5% |
| Other values (35) | 4349970 |
Common
| Value | Count | Frequency (%) |
| 1376 | ||
| - | 283 | 15.8% |
| _ | 126 | 7.0% |
| 8 | 2 | 0.1% |
| 4 | 2 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 17059878 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 2241799 | |
| l | 2085545 | |
| o | 1908402 | |
| r | 1107910 | 6.5% |
| c | 989189 | 5.8% |
| d | 934105 | 5.5% |
| s | 915271 | 5.4% |
| u | 887926 | 5.2% |
| M | 867048 | 5.1% |
| n | 770924 | 4.5% |
| Other values (40) | 4351759 |
class
Text
Missing 
| Distinct | 140 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 76135 |
| Missing (%) | 4.0% |
| Memory size | 14.7 MiB |
Length
| Max length | 20 |
|---|---|
| Median length | 19 |
| Mean length | 10.09882287 |
| Min length | 4 |
Unique
| Unique | 18 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Calcarea |
|---|---|
| 2nd row | Gastropoda |
| 3rd row | Anthozoa |
| 4th row | Ophiuroidea |
| 5th row | Gastropoda |
| Value | Count | Frequency (%) |
| gastropoda | 612643 | |
| malacostraca | 301920 | |
| polychaeta | 210885 | 11.4% |
| bivalvia | 207657 | 11.2% |
| anthozoa | 93047 | 5.0% |
| maxillopoda | 54367 | 2.9% |
| chromadorea | 34765 | 1.9% |
| ophiuroidea | 27083 | 1.5% |
| asteroidea | 25627 | 1.4% |
| oligochaeta | 25284 | 1.4% |
| Other values (130) | 256648 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 4039714 | |
| o | 2550884 | |
| t | 1353417 | 7.2% |
| r | 1171215 | 6.3% |
| s | 1019474 | 5.5% |
| c | 959574 | 5.1% |
| d | 951027 | 5.1% |
| l | 934032 | 5.0% |
| p | 820053 | 4.4% |
| i | 730905 | 3.9% |
| Other values (33) | 4151780 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 16832149 | |
| Uppercase Letter | 1849926 | 9.9% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 4039714 | |
| o | 2550884 | |
| t | 1353417 | 8.0% |
| r | 1171215 | 7.0% |
| s | 1019474 | 6.1% |
| c | 959574 | 5.7% |
| d | 951027 | 5.7% |
| l | 934032 | 5.5% |
| p | 820053 | 4.9% |
| i | 730905 | 4.3% |
| Other values (14) | 2301854 |
Uppercase Letter
| Value | Count | Frequency (%) |
| G | 612731 | |
| M | 363412 | |
| P | 227238 | 12.3% |
| B | 211201 | 11.4% |
| A | 153646 | 8.3% |
| C | 79724 | 4.3% |
| O | 75877 | 4.1% |
| H | 41146 | 2.2% |
| T | 27301 | 1.5% |
| E | 23522 | 1.3% |
| Other values (9) | 34128 | 1.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 18682075 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 4039714 | |
| o | 2550884 | |
| t | 1353417 | 7.2% |
| r | 1171215 | 6.3% |
| s | 1019474 | 5.5% |
| c | 959574 | 5.1% |
| d | 951027 | 5.1% |
| l | 934032 | 5.0% |
| p | 820053 | 4.4% |
| i | 730905 | 3.9% |
| Other values (33) | 4151780 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 18682075 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 4039714 | |
| o | 2550884 | |
| t | 1353417 | 7.2% |
| r | 1171215 | 6.3% |
| s | 1019474 | 5.5% |
| c | 959574 | 5.1% |
| d | 951027 | 5.1% |
| l | 934032 | 5.0% |
| p | 820053 | 4.4% |
| i | 730905 | 3.9% |
| Other values (33) | 4151780 |
order
Text
Missing 
| Distinct | 464 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 940799 |
| Missing (%) | 48.8% |
| Memory size | 14.7 MiB |
Length
| Max length | 28 |
|---|---|
| Median length | 21 |
| Mean length | 10.1311032 |
| Min length | 5 |
Unique
| Unique | 50 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Antipatharia |
|---|---|
| 2nd row | Ophiurida |
| 3rd row | Forcipulatida |
| 4th row | Forcipulatida |
| 5th row | Decapoda |
| Value | Count | Frequency (%) |
| decapoda | 196699 | |
| phyllodocida | 69303 | 7.0% |
| scleractinia | 54206 | 5.5% |
| amphipoda | 49518 | 5.0% |
| isopoda | 28998 | 2.9% |
| terebellida | 28660 | 2.9% |
| unionoida | 28558 | 2.9% |
| eunicida | 25633 | 2.6% |
| ophiurida | 22910 | 2.3% |
| calanoida | 21058 | 2.1% |
| Other values (456) | 459889 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 1613568 | |
| o | 1058436 | |
| i | 1000708 | |
| d | 988403 | |
| c | 644706 | 6.5% |
| e | 615446 | 6.2% |
| p | 533792 | 5.3% |
| l | 509511 | 5.1% |
| n | 359765 | 3.6% |
| r | 349430 | 3.5% |
| Other values (40) | 2308026 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 8996191 | |
| Uppercase Letter | 985095 | 9.9% |
| Space Separator | 170 | < 0.1% |
| Open Punctuation | 167 | < 0.1% |
| Close Punctuation | 167 | < 0.1% |
| Other Punctuation | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 1613568 | |
| o | 1058436 | |
| i | 1000708 | |
| d | 988403 | |
| c | 644706 | 7.2% |
| e | 615446 | 6.8% |
| p | 533792 | 5.9% |
| l | 509511 | 5.7% |
| n | 359765 | 4.0% |
| r | 349430 | 3.9% |
| Other values (14) | 1322426 |
Uppercase Letter
| Value | Count | Frequency (%) |
| D | 209529 | |
| S | 132542 | |
| P | 130890 | |
| A | 92015 | |
| C | 72395 | 7.3% |
| T | 63419 | 6.4% |
| E | 57025 | 5.8% |
| O | 31777 | 3.2% |
| I | 29448 | 3.0% |
| U | 28573 | 2.9% |
| Other values (12) | 137482 |
Space Separator
| Value | Count | Frequency (%) |
| 170 |
Open Punctuation
| Value | Count | Frequency (%) |
| [ | 167 |
Close Punctuation
| Value | Count | Frequency (%) |
| ] | 167 |
Other Punctuation
| Value | Count | Frequency (%) |
| ? | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 9981286 | |
| Common | 505 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 1613568 | |
| o | 1058436 | |
| i | 1000708 | |
| d | 988403 | |
| c | 644706 | 6.5% |
| e | 615446 | 6.2% |
| p | 533792 | 5.3% |
| l | 509511 | 5.1% |
| n | 359765 | 3.6% |
| r | 349430 | 3.5% |
| Other values (36) | 2307521 |
Common
| Value | Count | Frequency (%) |
| 170 | ||
| [ | 167 | |
| ] | 167 | |
| ? | 1 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9981791 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 1613568 | |
| o | 1058436 | |
| i | 1000708 | |
| d | 988403 | |
| c | 644706 | 6.5% |
| e | 615446 | 6.2% |
| p | 533792 | 5.3% |
| l | 509511 | 5.1% |
| n | 359765 | 3.6% |
| r | 349430 | 3.5% |
| Other values (40) | 2308026 |
family
Text
Missing 
| Distinct | 3009 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 191835 |
| Missing (%) | 10.0% |
| Memory size | 14.7 MiB |
Length
| Max length | 27 |
|---|---|
| Median length | 23 |
| Mean length | 11.08729197 |
| Min length | 6 |
Unique
| Unique | 298 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Bullidae |
|---|---|
| 2nd row | Stylopathidae |
| 3rd row | Ophiotrichidae |
| 4th row | Cypraeidae |
| 5th row | Asteriidae |
| Value | Count | Frequency (%) |
| conidae | 38810 | 2.2% |
| cambaridae | 29321 | 1.7% |
| unionidae | 26838 | 1.5% |
| veneridae | 17888 | 1.0% |
| trochidae | 16919 | 1.0% |
| cerithiidae | 16894 | 1.0% |
| cypraeidae | 16831 | 1.0% |
| spionidae | 15844 | 0.9% |
| buccinidae | 15338 | 0.9% |
| syllidae | 14112 | 0.8% |
| Other values (2998) | 1525570 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 2890115 | |
| a | 2634149 | |
| e | 2542492 | |
| d | 1977044 | |
| r | 977057 | 5.1% |
| l | 957443 | 5.0% |
| o | 947839 | 4.9% |
| n | 841051 | 4.4% |
| t | 631565 | 3.3% |
| c | 540802 | 2.8% |
| Other values (45) | 4288313 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 17493461 | |
| Uppercase Letter | 1734226 | 9.0% |
| Space Separator | 139 | < 0.1% |
| Other Punctuation | 41 | < 0.1% |
| Math Symbol | 3 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 2890115 | |
| a | 2634149 | |
| e | 2542492 | |
| d | 1977044 | |
| r | 977057 | 5.6% |
| l | 957443 | 5.5% |
| o | 947839 | 5.4% |
| n | 841051 | 4.8% |
| t | 631565 | 3.6% |
| c | 540802 | 3.1% |
| Other values (16) | 2553904 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 285220 | |
| P | 255936 | |
| S | 149154 | |
| A | 140208 | 8.1% |
| T | 138389 | 8.0% |
| M | 92451 | 5.3% |
| O | 88221 | 5.1% |
| L | 80443 | 4.6% |
| H | 73149 | 4.2% |
| N | 66069 | 3.8% |
| Other values (15) | 364986 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 28 | |
| ? | 13 |
Space Separator
| Value | Count | Frequency (%) |
| 139 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 3 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 19227687 | |
| Common | 183 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 2890115 | |
| a | 2634149 | |
| e | 2542492 | |
| d | 1977044 | |
| r | 977057 | 5.1% |
| l | 957443 | 5.0% |
| o | 947839 | 4.9% |
| n | 841051 | 4.4% |
| t | 631565 | 3.3% |
| c | 540802 | 2.8% |
| Other values (41) | 4288130 |
Common
| Value | Count | Frequency (%) |
| 139 | ||
| . | 28 | 15.3% |
| ? | 13 | 7.1% |
| + | 3 | 1.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 19227870 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 2890115 | |
| a | 2634149 | |
| e | 2542492 | |
| d | 1977044 | |
| r | 977057 | 5.1% |
| l | 957443 | 5.0% |
| o | 947839 | 4.9% |
| n | 841051 | 4.4% |
| t | 631565 | 3.3% |
| c | 540802 | 2.8% |
| Other values (45) | 4288313 |
subfamily
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 1926060 |
| Missing (%) | > 99.9% |
| Memory size | 14.7 MiB |
Length
| Max length | 9 |
|---|---|
| Median length | 9 |
| Mean length | 9 |
| Min length | 9 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 47 57 0 N |
|---|
| Value | Count | Frequency (%) |
| 47 | 1 | |
| 57 | 1 | |
| 0 | 1 | |
| n | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 3 | ||
| 7 | 2 | |
| 4 | 1 | 11.1% |
| 5 | 1 | 11.1% |
| 0 | 1 | 11.1% |
| N | 1 | 11.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 5 | |
| Space Separator | 3 | |
| Uppercase Letter | 1 | 11.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 7 | 2 | |
| 4 | 1 | |
| 5 | 1 | |
| 0 | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 3 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 8 | |
| Latin | 1 | 11.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 3 | ||
| 7 | 2 | |
| 4 | 1 | 12.5% |
| 5 | 1 | 12.5% |
| 0 | 1 | 12.5% |
Latin
| Value | Count | Frequency (%) |
| N | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 3 | ||
| 7 | 2 | |
| 4 | 1 | 11.1% |
| 5 | 1 | 11.1% |
| 0 | 1 | 11.1% |
| N | 1 | 11.1% |
tribe
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 1926060 |
| Missing (%) | > 99.9% |
| Memory size | 14.7 MiB |
Length
| Max length | 9 |
|---|---|
| Median length | 9 |
| Mean length | 9 |
| Min length | 9 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 129 4 0 W |
|---|
| Value | Count | Frequency (%) |
| 129 | 1 | |
| 4 | 1 | |
| 0 | 1 | |
| w | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 3 | ||
| 1 | 1 | 11.1% |
| 2 | 1 | 11.1% |
| 9 | 1 | 11.1% |
| 4 | 1 | 11.1% |
| 0 | 1 | 11.1% |
| W | 1 | 11.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 5 | |
| Space Separator | 3 | |
| Uppercase Letter | 1 | 11.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1 | |
| 2 | 1 | |
| 9 | 1 | |
| 4 | 1 | |
| 0 | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 3 |
Uppercase Letter
| Value | Count | Frequency (%) |
| W | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 8 | |
| Latin | 1 | 11.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 3 | ||
| 1 | 1 | 12.5% |
| 2 | 1 | 12.5% |
| 9 | 1 | 12.5% |
| 4 | 1 | 12.5% |
| 0 | 1 | 12.5% |
Latin
| Value | Count | Frequency (%) |
| W | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 3 | ||
| 1 | 1 | 11.1% |
| 2 | 1 | 11.1% |
| 9 | 1 | 11.1% |
| 4 | 1 | 11.1% |
| 0 | 1 | 11.1% |
| W | 1 | 11.1% |
subtribe
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 1926060 |
| Missing (%) | > 99.9% |
| Memory size | 14.7 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 13 |
| Mean length | 13 |
| Min length | 13 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Seurat, L. G. |
|---|
| Value | Count | Frequency (%) |
| seurat | 1 | |
| l | 1 | |
| g | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | ||
| . | 2 | |
| S | 1 | |
| e | 1 | |
| u | 1 | |
| r | 1 | |
| a | 1 | |
| t | 1 | |
| , | 1 | |
| L | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 5 | |
| Other Punctuation | 3 | |
| Uppercase Letter | 3 | |
| Space Separator | 2 | 15.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1 | |
| u | 1 | |
| r | 1 | |
| a | 1 | |
| t | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 1 | |
| L | 1 | |
| G | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 2 | |
| , | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8 | |
| Common | 5 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| S | 1 | |
| e | 1 | |
| u | 1 | |
| r | 1 | |
| a | 1 | |
| t | 1 | |
| L | 1 | |
| G | 1 |
Common
| Value | Count | Frequency (%) |
| 2 | ||
| . | 2 | |
| , | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 13 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | ||
| . | 2 | |
| S | 1 | |
| e | 1 | |
| u | 1 | |
| r | 1 | |
| a | 1 | |
| t | 1 | |
| , | 1 | |
| L | 1 |
genus
Text
Missing 
| Distinct | 21650 |
|---|---|
| Distinct (%) | 1.4% |
| Missing | 353878 |
| Missing (%) | 18.4% |
| Memory size | 14.7 MiB |
Length
| Max length | 27 |
|---|---|
| Median length | 23 |
| Mean length | 9.304575867 |
| Min length | 2 |
Unique
| Unique | 4273 ? |
|---|---|
| Unique (%) | 0.3% |
Sample
| 1st row | Scypha |
|---|---|
| 2nd row | Bulla |
| 3rd row | Stylopathes |
| 4th row | Ophiothrix |
| 5th row | Cypraea |
| Value | Count | Frequency (%) |
| conus | 24245 | 1.5% |
| cypraea | 15393 | 1.0% |
| cambarus | 10444 | 0.7% |
| cerithium | 9393 | 0.6% |
| orconectes | 8665 | 0.6% |
| procambarus | 8127 | 0.5% |
| nassarius | 6727 | 0.4% |
| lumbrineris | 4966 | 0.3% |
| terebra | 4965 | 0.3% |
| aricidea | 4582 | 0.3% |
| Other values (21641) | 1474698 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 1747537 | 11.9% |
| i | 1265907 | 8.7% |
| o | 1157768 | 7.9% |
| e | 1018454 | 7.0% |
| r | 970134 | 6.6% |
| s | 940035 | 6.4% |
| l | 916555 | 6.3% |
| t | 707468 | 4.8% |
| n | 704501 | 4.8% |
| u | 688774 | 4.7% |
| Other values (46) | 4511363 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 13056275 | |
| Uppercase Letter | 1572183 | 10.7% |
| Space Separator | 22 | < 0.1% |
| Other Punctuation | 11 | < 0.1% |
| Dash Punctuation | 5 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 1747537 | |
| i | 1265907 | |
| o | 1157768 | 8.9% |
| e | 1018454 | 7.8% |
| r | 970134 | 7.4% |
| s | 940035 | 7.2% |
| l | 916555 | 7.0% |
| t | 707468 | 5.4% |
| n | 704501 | 5.4% |
| u | 688774 | 5.3% |
| Other values (16) | 2939142 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 229739 | |
| P | 219867 | |
| A | 155473 | |
| S | 126285 | 8.0% |
| M | 103197 | 6.6% |
| T | 96867 | 6.2% |
| L | 91233 | 5.8% |
| E | 82418 | 5.2% |
| O | 74602 | 4.7% |
| N | 62734 | 4.0% |
| Other values (16) | 329768 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 10 | |
| / | 1 | 9.1% |
Space Separator
| Value | Count | Frequency (%) |
| 22 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 5 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 14628458 | |
| Common | 38 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 1747537 | 11.9% |
| i | 1265907 | 8.7% |
| o | 1157768 | 7.9% |
| e | 1018454 | 7.0% |
| r | 970134 | 6.6% |
| s | 940035 | 6.4% |
| l | 916555 | 6.3% |
| t | 707468 | 4.8% |
| n | 704501 | 4.8% |
| u | 688774 | 4.7% |
| Other values (42) | 4511325 |
Common
| Value | Count | Frequency (%) |
| 22 | ||
| . | 10 | |
| - | 5 | 13.2% |
| / | 1 | 2.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 14628496 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 1747537 | 11.9% |
| i | 1265907 | 8.7% |
| o | 1157768 | 7.9% |
| e | 1018454 | 7.0% |
| r | 970134 | 6.6% |
| s | 940035 | 6.4% |
| l | 916555 | 6.3% |
| t | 707468 | 4.8% |
| n | 704501 | 4.8% |
| u | 688774 | 4.7% |
| Other values (46) | 4511363 |
subgenus
Text
Missing 
| Distinct | 2864 |
|---|---|
| Distinct (%) | 2.5% |
| Missing | 1813329 |
| Missing (%) | 94.1% |
| Memory size | 14.7 MiB |
Length
| Max length | 19 |
|---|---|
| Median length | 16 |
| Mean length | 10.25069191 |
| Min length | 3 |
Unique
| Unique | 738 ? |
|---|---|
| Unique (%) | 0.7% |
Sample
| 1st row | Ortmannicus |
|---|---|
| 2nd row | Torquis |
| 3rd row | Scolelepis |
| 4th row | Caryophyllia |
| 5th row | Pitarenus |
| Value | Count | Frequency (%) |
| thericium | 3470 | 3.1% |
| depressicambarus | 2960 | 2.6% |
| ortmannicus | 2586 | 2.3% |
| stephanoconus | 2431 | 2.2% |
| cambarus | 1558 | 1.4% |
| canarium | 1428 | 1.3% |
| nebularia | 1392 | 1.2% |
| costellaria | 1392 | 1.2% |
| strigatella | 1335 | 1.2% |
| pennides | 1328 | 1.2% |
| Other values (2854) | 92852 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 138855 | |
| i | 104726 | 9.1% |
| o | 86546 | 7.5% |
| r | 84920 | 7.3% |
| s | 80324 | 7.0% |
| l | 69319 | 6.0% |
| e | 68847 | 6.0% |
| u | 66055 | 5.7% |
| n | 64717 | 5.6% |
| t | 53444 | 4.6% |
| Other values (42) | 337828 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1042849 | |
| Uppercase Letter | 112732 | 9.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 138855 | |
| i | 104726 | |
| o | 86546 | 8.3% |
| r | 84920 | 8.1% |
| s | 80324 | 7.7% |
| l | 69319 | 6.6% |
| e | 68847 | 6.6% |
| u | 66055 | 6.3% |
| n | 64717 | 6.2% |
| t | 53444 | 5.1% |
| Other values (16) | 225096 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 16928 | |
| P | 16842 | |
| A | 10435 | |
| T | 9865 | |
| S | 8909 | 7.9% |
| M | 6567 | 5.8% |
| L | 6453 | 5.7% |
| D | 5642 | 5.0% |
| O | 4368 | 3.9% |
| N | 3515 | 3.1% |
| Other values (16) | 23208 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1155581 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 138855 | |
| i | 104726 | 9.1% |
| o | 86546 | 7.5% |
| r | 84920 | 7.3% |
| s | 80324 | 7.0% |
| l | 69319 | 6.0% |
| e | 68847 | 6.0% |
| u | 66055 | 5.7% |
| n | 64717 | 5.6% |
| t | 53444 | 4.6% |
| Other values (42) | 337828 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1155581 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 138855 | |
| i | 104726 | 9.1% |
| o | 86546 | 7.5% |
| r | 84920 | 7.3% |
| s | 80324 | 7.0% |
| l | 69319 | 6.0% |
| e | 68847 | 6.0% |
| u | 66055 | 5.7% |
| n | 64717 | 5.6% |
| t | 53444 | 4.6% |
| Other values (42) | 337828 |
specificEpithet
Text
Missing 
| Distinct | 46656 |
|---|---|
| Distinct (%) | 3.0% |
| Missing | 353916 |
| Missing (%) | 18.4% |
| Memory size | 14.7 MiB |
Length
| Max length | 22 |
|---|---|
| Median length | 19 |
| Mean length | 7.826697919 |
| Min length | 1 |
Unique
| Unique | 13428 ? |
|---|---|
| Unique (%) | 0.9% |
Sample
| 1st row | sp. |
|---|---|
| 2nd row | striata |
| 3rd row | columnaris |
| 4th row | suensonii |
| 5th row | labrolineata |
| Value | Count | Frequency (%) |
| sp | 198016 | 12.6% |
| gracilis | 6359 | 0.4% |
| affinis | 3601 | 0.2% |
| fragilis | 3504 | 0.2% |
| elegans | 3414 | 0.2% |
| aculeata | 3109 | 0.2% |
| borealis | 2990 | 0.2% |
| americanus | 2825 | 0.2% |
| grandis | 2552 | 0.2% |
| tenuis | 2439 | 0.2% |
| Other values (46628) | 1344736 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 1648406 | |
| i | 1322391 | |
| s | 1208599 | |
| e | 828620 | 6.7% |
| r | 813081 | 6.6% |
| t | 747136 | 6.1% |
| u | 735526 | 6.0% |
| n | 734569 | 6.0% |
| l | 699734 | 5.7% |
| c | 585047 | 4.8% |
| Other values (36) | 2981595 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 12104862 | |
| Other Punctuation | 198291 | 1.6% |
| Space Separator | 1400 | < 0.1% |
| Dash Punctuation | 89 | < 0.1% |
| Decimal Number | 56 | < 0.1% |
| Open Punctuation | 3 | < 0.1% |
| Close Punctuation | 3 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 1648406 | |
| i | 1322391 | |
| s | 1208599 | |
| e | 828620 | 6.8% |
| r | 813081 | 6.7% |
| t | 747136 | 6.2% |
| u | 735526 | 6.1% |
| n | 734569 | 6.1% |
| l | 699734 | 5.8% |
| c | 585047 | 4.8% |
| Other values (18) | 2781753 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 198202 | |
| " | 58 | < 0.1% |
| ' | 13 | < 0.1% |
| / | 13 | < 0.1% |
| , | 3 | < 0.1% |
| ? | 2 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 31 | |
| 2 | 18 | |
| 3 | 4 | 7.1% |
| 4 | 1 | 1.8% |
| 5 | 1 | 1.8% |
| 6 | 1 | 1.8% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 2 | |
| [ | 1 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 2 | |
| ] | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 1400 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 89 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 12104862 | |
| Common | 199842 | 1.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 1648406 | |
| i | 1322391 | |
| s | 1208599 | |
| e | 828620 | 6.8% |
| r | 813081 | 6.7% |
| t | 747136 | 6.2% |
| u | 735526 | 6.1% |
| n | 734569 | 6.1% |
| l | 699734 | 5.8% |
| c | 585047 | 4.8% |
| Other values (18) | 2781753 |
Common
| Value | Count | Frequency (%) |
| . | 198202 | |
| 1400 | 0.7% | |
| - | 89 | < 0.1% |
| " | 58 | < 0.1% |
| 1 | 31 | < 0.1% |
| 2 | 18 | < 0.1% |
| ' | 13 | < 0.1% |
| / | 13 | < 0.1% |
| 3 | 4 | < 0.1% |
| , | 3 | < 0.1% |
| Other values (8) | 11 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 12304688 | |
| None | 16 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 1648406 | |
| i | 1322391 | |
| s | 1208599 | |
| e | 828620 | 6.7% |
| r | 813081 | 6.6% |
| t | 747136 | 6.1% |
| u | 735526 | 6.0% |
| n | 734569 | 6.0% |
| l | 699734 | 5.7% |
| c | 585047 | 4.8% |
| Other values (34) | 2981579 |
None
| Value | Count | Frequency (%) |
| ü | 15 | |
| æ | 1 | 6.2% |
Missing 
| Distinct | 6142 |
|---|---|
| Distinct (%) | 10.4% |
| Missing | 1866911 |
| Missing (%) | 96.9% |
| Memory size | 14.7 MiB |
Length
| Max length | 31 |
|---|---|
| Median length | 27 |
| Mean length | 8.681639899 |
| Min length | 3 |
Unique
| Unique | 2084 ? |
|---|---|
| Unique (%) | 3.5% |
Sample
| 1st row | tuberculosa |
|---|---|
| 2nd row | imbricata |
| 3rd row | connectens |
| 4th row | laevis |
| 5th row | bonachensis |
| Value | Count | Frequency (%) |
| acutus | 1104 | 1.8% |
| radiata | 638 | 1.1% |
| bartonii | 521 | 0.9% |
| gibbosus | 501 | 0.8% |
| appressa | 444 | 0.7% |
| modicella | 437 | 0.7% |
| rusticus | 389 | 0.6% |
| campanulata | 379 | 0.6% |
| carinata | 372 | 0.6% |
| minor | 370 | 0.6% |
| Other values (6099) | 54802 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 74589 | |
| i | 56561 | |
| s | 47859 | |
| e | 37680 | 7.3% |
| n | 37239 | 7.3% |
| r | 32753 | 6.4% |
| u | 31295 | 6.1% |
| t | 28838 | 5.6% |
| l | 28120 | 5.5% |
| c | 26380 | 5.1% |
| Other values (23) | 112205 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 512368 | |
| Space Separator | 807 | 0.2% |
| Other Punctuation | 332 | 0.1% |
| Dash Punctuation | 12 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 74589 | |
| i | 56561 | |
| s | 47859 | |
| e | 37680 | 7.4% |
| n | 37239 | 7.3% |
| r | 32753 | 6.4% |
| u | 31295 | 6.1% |
| t | 28838 | 5.6% |
| l | 28120 | 5.5% |
| c | 26380 | 5.1% |
| Other values (16) | 111054 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 313 | |
| / | 15 | 4.5% |
| ' | 2 | 0.6% |
| ? | 1 | 0.3% |
| , | 1 | 0.3% |
Space Separator
| Value | Count | Frequency (%) |
| 807 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 12 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 512368 | |
| Common | 1151 | 0.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 74589 | |
| i | 56561 | |
| s | 47859 | |
| e | 37680 | 7.4% |
| n | 37239 | 7.3% |
| r | 32753 | 6.4% |
| u | 31295 | 6.1% |
| t | 28838 | 5.6% |
| l | 28120 | 5.5% |
| c | 26380 | 5.1% |
| Other values (16) | 111054 |
Common
| Value | Count | Frequency (%) |
| 807 | ||
| . | 313 | 27.2% |
| / | 15 | 1.3% |
| - | 12 | 1.0% |
| ' | 2 | 0.2% |
| ? | 1 | 0.1% |
| , | 1 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 513519 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 74589 | |
| i | 56561 | |
| s | 47859 | |
| e | 37680 | 7.3% |
| n | 37239 | 7.3% |
| r | 32753 | 6.4% |
| u | 31295 | 6.1% |
| t | 28838 | 5.6% |
| l | 28120 | 5.5% |
| c | 26380 | 5.1% |
| Other values (23) | 112205 |
cultivarEpithet
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 33.3% |
| Missing | 1926058 |
| Missing (%) | > 99.9% |
| Memory size | 14.7 MiB |
Length
| Max length | 9 |
|---|---|
| Median length | 9 |
| Mean length | 9 |
| Min length | 9 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | GEOLocate |
|---|---|
| 2nd row | GEOLocate |
| 3rd row | GEOLocate |
| Value | Count | Frequency (%) |
| geolocate | 3 |
Most occurring characters
| Value | Count | Frequency (%) |
| G | 3 | |
| E | 3 | |
| O | 3 | |
| L | 3 | |
| o | 3 | |
| c | 3 | |
| a | 3 | |
| t | 3 | |
| e | 3 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 15 | |
| Uppercase Letter | 12 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 3 | |
| c | 3 | |
| a | 3 | |
| t | 3 | |
| e | 3 |
Uppercase Letter
| Value | Count | Frequency (%) |
| G | 3 | |
| E | 3 | |
| O | 3 | |
| L | 3 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 27 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| G | 3 | |
| E | 3 | |
| O | 3 | |
| L | 3 | |
| o | 3 | |
| c | 3 | |
| a | 3 | |
| t | 3 | |
| e | 3 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 27 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| G | 3 | |
| E | 3 | |
| O | 3 | |
| L | 3 | |
| o | 3 | |
| c | 3 | |
| a | 3 | |
| t | 3 | |
| e | 3 |
taxonRank
Text
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1866911 |
| Missing (%) | 96.9% |
| Memory size | 14.7 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 9.999847844 |
| Min length | 7 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | subspecies |
|---|---|
| 2nd row | subspecies |
| 3rd row | subspecies |
| 4th row | subspecies |
| 5th row | subspecies |
| Value | Count | Frequency (%) |
| subspecies | 59147 | |
| variety | 3 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| s | 177441 | |
| e | 118297 | |
| i | 59150 | 10.0% |
| u | 59147 | 10.0% |
| b | 59147 | 10.0% |
| p | 59147 | 10.0% |
| c | 59147 | 10.0% |
| V | 3 | < 0.1% |
| a | 3 | < 0.1% |
| r | 3 | < 0.1% |
| Other values (2) | 6 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 591488 | |
| Uppercase Letter | 3 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| s | 177441 | |
| e | 118297 | |
| i | 59150 | 10.0% |
| u | 59147 | 10.0% |
| b | 59147 | 10.0% |
| p | 59147 | 10.0% |
| c | 59147 | 10.0% |
| a | 3 | < 0.1% |
| r | 3 | < 0.1% |
| t | 3 | < 0.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| V | 3 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 591491 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| s | 177441 | |
| e | 118297 | |
| i | 59150 | 10.0% |
| u | 59147 | 10.0% |
| b | 59147 | 10.0% |
| p | 59147 | 10.0% |
| c | 59147 | 10.0% |
| V | 3 | < 0.1% |
| a | 3 | < 0.1% |
| r | 3 | < 0.1% |
| Other values (2) | 6 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 591491 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| s | 177441 | |
| e | 118297 | |
| i | 59150 | 10.0% |
| u | 59147 | 10.0% |
| b | 59147 | 10.0% |
| p | 59147 | 10.0% |
| c | 59147 | 10.0% |
| V | 3 | < 0.1% |
| a | 3 | < 0.1% |
| r | 3 | < 0.1% |
| Other values (2) | 6 | < 0.1% |
Missing 
| Distinct | 12117 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 756930 |
| Missing (%) | 39.3% |
| Memory size | 14.7 MiB |
Length
| Max length | 69 |
|---|---|
| Median length | 47 |
| Mean length | 8.788540377 |
| Min length | 2 |
Unique
| Unique | 2539 ? |
|---|---|
| Unique (%) | 0.2% |
Sample
| 1st row | Bruguière |
|---|---|
| 2nd row | (Duchassaing) |
| 3rd row | Lutken |
| 4th row | Gaokoin |
| 5th row | Fisher |
| Value | Count | Frequency (%) |
| 98247 | 6.8% | |
| linnaeus | 78120 | 5.4% |
| say | 43821 | 3.0% |
| lamarck | 28278 | 1.9% |
| verrill | 22061 | 1.5% |
| stimpson | 21858 | 1.5% |
| gmelin | 20022 | 1.4% |
| dall | 17930 | 1.2% |
| sowerby | 15888 | 1.1% |
| smith | 15824 | 1.1% |
| Other values (7043) | 1091668 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 906924 | 8.8% |
| a | 780692 | 7.6% |
| n | 688330 | 6.7% |
| r | 661943 | 6.4% |
| ( | 616138 | 6.0% |
| ) | 616138 | 6.0% |
| i | 579234 | 5.6% |
| s | 498727 | 4.9% |
| l | 491449 | 4.8% |
| o | 390672 | 3.8% |
| Other values (78) | 4044708 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 7284990 | |
| Uppercase Letter | 1336396 | 13.0% |
| Open Punctuation | 616138 | 6.0% |
| Close Punctuation | 616138 | 6.0% |
| Space Separator | 284586 | 2.8% |
| Other Punctuation | 118180 | 1.2% |
| Dash Punctuation | 18527 | 0.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 906924 | |
| a | 780692 | |
| n | 688330 | |
| r | 661943 | 9.1% |
| i | 579234 | 8.0% |
| s | 498727 | 6.8% |
| l | 491449 | 6.7% |
| o | 390672 | 5.4% |
| u | 306318 | 4.2% |
| t | 301316 | 4.1% |
| Other values (40) | 1679385 |
Uppercase Letter
| Value | Count | Frequency (%) |
| L | 176987 | |
| S | 174895 | |
| M | 116679 | 8.7% |
| B | 107118 | 8.0% |
| H | 100415 | 7.5% |
| C | 73945 | 5.5% |
| D | 73228 | 5.5% |
| G | 71058 | 5.3% |
| R | 70187 | 5.3% |
| P | 59533 | 4.5% |
| Other values (20) | 312351 |
Other Punctuation
| Value | Count | Frequency (%) |
| & | 98246 | |
| . | 12420 | 10.5% |
| ' | 7462 | 6.3% |
| , | 52 | < 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 616138 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 616138 |
Space Separator
| Value | Count | Frequency (%) |
| 284586 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 18527 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8621386 | |
| Common | 1653569 | 16.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 906924 | 10.5% |
| a | 780692 | 9.1% |
| n | 688330 | 8.0% |
| r | 661943 | 7.7% |
| i | 579234 | 6.7% |
| s | 498727 | 5.8% |
| l | 491449 | 5.7% |
| o | 390672 | 4.5% |
| u | 306318 | 3.6% |
| t | 301316 | 3.5% |
| Other values (70) | 3015781 |
Common
| Value | Count | Frequency (%) |
| ( | 616138 | |
| ) | 616138 | |
| 284586 | ||
| & | 98246 | 5.9% |
| - | 18527 | 1.1% |
| . | 12420 | 0.8% |
| ' | 7462 | 0.5% |
| , | 52 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 10228371 | |
| None | 46584 | 0.5% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 906924 | 8.9% |
| a | 780692 | 7.6% |
| n | 688330 | 6.7% |
| r | 661943 | 6.5% |
| ( | 616138 | 6.0% |
| ) | 616138 | 6.0% |
| i | 579234 | 5.7% |
| s | 498727 | 4.9% |
| l | 491449 | 4.8% |
| o | 390672 | 3.8% |
| Other values (50) | 3998124 |
None
| Value | Count | Frequency (%) |
| ü | 17514 | |
| è | 17194 | |
| é | 4508 | 9.7% |
| ä | 1796 | 3.9% |
| ö | 1657 | 3.6% |
| ø | 1384 | 3.0% |
| å | 620 | 1.3% |
| Ö | 391 | 0.8% |
| á | 269 | 0.6% |
| ñ | 248 | 0.5% |
| Other values (18) | 1003 | 2.2% |
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 1926059 |
| Missing (%) | > 99.9% |
| Memory size | 14.7 MiB |
Length
| Max length | 17 |
|---|---|
| Median length | 15 |
| Mean length | 15 |
| Min length | 13 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Van Cleave, H. J. |
|---|---|
| 2nd row | Schwartz, Ben |
| Value | Count | Frequency (%) |
| van | 1 | |
| cleave | 1 | |
| h | 1 | |
| j | 1 | |
| schwartz | 1 | |
| ben | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 4 | 13.3% | |
| e | 3 | 10.0% |
| a | 3 | 10.0% |
| . | 2 | 6.7% |
| n | 2 | 6.7% |
| , | 2 | 6.7% |
| c | 1 | 3.3% |
| z | 1 | 3.3% |
| t | 1 | 3.3% |
| r | 1 | 3.3% |
| Other values (10) | 10 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 16 | |
| Uppercase Letter | 6 | 20.0% |
| Space Separator | 4 | 13.3% |
| Other Punctuation | 4 | 13.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 3 | |
| a | 3 | |
| n | 2 | |
| c | 1 | 6.2% |
| z | 1 | 6.2% |
| t | 1 | 6.2% |
| r | 1 | 6.2% |
| w | 1 | 6.2% |
| h | 1 | 6.2% |
| v | 1 | 6.2% |
Uppercase Letter
| Value | Count | Frequency (%) |
| V | 1 | |
| S | 1 | |
| J | 1 | |
| H | 1 | |
| C | 1 | |
| B | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 2 | |
| , | 2 |
Space Separator
| Value | Count | Frequency (%) |
| 4 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 22 | |
| Common | 8 | 26.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 3 | |
| a | 3 | |
| n | 2 | 9.1% |
| c | 1 | 4.5% |
| z | 1 | 4.5% |
| t | 1 | 4.5% |
| r | 1 | 4.5% |
| w | 1 | 4.5% |
| h | 1 | 4.5% |
| V | 1 | 4.5% |
| Other values (7) | 7 |
Common
| Value | Count | Frequency (%) |
| 4 | ||
| . | 2 | |
| , | 2 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 30 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 4 | 13.3% | |
| e | 3 | 10.0% |
| a | 3 | 10.0% |
| . | 2 | 6.7% |
| n | 2 | 6.7% |
| , | 2 | 6.7% |
| c | 1 | 3.3% |
| z | 1 | 3.3% |
| t | 1 | 3.3% |
| r | 1 | 3.3% |
| Other values (10) | 10 |
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 1926060 |
| Missing (%) | > 99.9% |
| Memory size | 14.7 MiB |
Length
| Max length | 18 |
|---|---|
| Median length | 18 |
| Mean length | 18 |
| Min length | 18 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Camallanus seurati |
|---|
| Value | Count | Frequency (%) |
| camallanus | 1 | |
| seurati | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 4 | |
| l | 2 | |
| u | 2 | |
| s | 2 | |
| C | 1 | 5.6% |
| m | 1 | 5.6% |
| n | 1 | 5.6% |
| 1 | 5.6% | |
| e | 1 | 5.6% |
| r | 1 | 5.6% |
| Other values (2) | 2 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 16 | |
| Uppercase Letter | 1 | 5.6% |
| Space Separator | 1 | 5.6% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 4 | |
| l | 2 | |
| u | 2 | |
| s | 2 | |
| m | 1 | 6.2% |
| n | 1 | 6.2% |
| e | 1 | 6.2% |
| r | 1 | 6.2% |
| t | 1 | 6.2% |
| i | 1 | 6.2% |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 17 | |
| Common | 1 | 5.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 4 | |
| l | 2 | |
| u | 2 | |
| s | 2 | |
| C | 1 | 5.9% |
| m | 1 | 5.9% |
| n | 1 | 5.9% |
| e | 1 | 5.9% |
| r | 1 | 5.9% |
| t | 1 | 5.9% |
Common
| Value | Count | Frequency (%) |
| 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 18 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 4 | |
| l | 2 | |
| u | 2 | |
| s | 2 | |
| C | 1 | 5.6% |
| m | 1 | 5.6% |
| n | 1 | 5.6% |
| 1 | 5.6% | |
| e | 1 | 5.6% |
| r | 1 | 5.6% |
| Other values (2) | 2 |